1
|
Beals J, Hu H, Li X. A survey of experimental and computational identification of small proteins. Brief Bioinform 2024; 25:bbae345. [PMID: 39007598 PMCID: PMC11247407 DOI: 10.1093/bib/bbae345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 05/27/2024] [Accepted: 07/02/2024] [Indexed: 07/16/2024] Open
Abstract
Small proteins (SPs) are typically characterized as eukaryotic proteins shorter than 100 amino acids and prokaryotic proteins shorter than 50 amino acids. Historically, they were disregarded because of the arbitrary size thresholds to define proteins. However, recent research has revealed the existence of many SPs and their crucial roles. Despite this, the identification of SPs and the elucidation of their functions are still in their infancy. To pave the way for future SP studies, we briefly introduce the limitations and advancements in experimental techniques for SP identification. We then provide an overview of available computational tools for SP identification, their constraints, and their evaluation. Additionally, we highlight existing resources for SP research. This survey aims to initiate further exploration into SPs and encourage the development of more sophisticated computational tools for SP identification in prokaryotes and microbiomes.
Collapse
Affiliation(s)
- Joshua Beals
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Haiyan Hu
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| | - Xiaoman Li
- Burnett School of Biomedical Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL 32816, United States
| |
Collapse
|
2
|
Aubel M, Buchel F, Heames B, Jones A, Honc O, Bornberg-Bauer E, Hlouchova K. High-throughput Selection of Human de novo-emerged sORFs with High Folding Potential. Genome Biol Evol 2024; 16:evae069. [PMID: 38597156 PMCID: PMC11024478 DOI: 10.1093/gbe/evae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/11/2024] [Accepted: 03/23/2024] [Indexed: 04/11/2024] Open
Abstract
De novo genes emerge from previously noncoding stretches of the genome. Their encoded de novo proteins are generally expected to be similar to random sequences and, accordingly, with no stable tertiary fold and high predicted disorder. However, structural properties of de novo proteins and whether they differ during the stages of emergence and fixation have not been studied in depth and rely heavily on predictions. Here we generated a library of short human putative de novo proteins of varying lengths and ages and sorted the candidates according to their structural compactness and disorder propensity. Using Förster resonance energy transfer combined with Fluorescence-activated cell sorting, we were able to screen the library for most compact protein structures, as well as most elongated and flexible structures. We find that compact de novo proteins are on average slightly shorter and contain lower predicted disorder than less compact ones. The predicted structures for most and least compact de novo proteins correspond to expectations in that they contain more secondary structure content or higher disorder content, respectively. Our experiments indicate that older de novo proteins have higher compactness and structural propensity compared with young ones. We discuss possible evolutionary scenarios and their implications underlying the age-dependencies of compactness and structural content of putative de novo proteins.
Collapse
Affiliation(s)
- Margaux Aubel
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Filip Buchel
- Department of Cell Biology, Faculty of Science, Charles University, Prague, Czech Republic
- Department of Biochemistry, Faculty of Science, Charles University, Prague, Czech Republic
| | - Brennen Heames
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Alun Jones
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Ondrej Honc
- Imaging Methods Core Facility, BIOCEV, Prague, Czech Republic
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
- Department of Protein Evolution, Max Planck-Institute for Biology Tuebingen, Tuebingen, Germany
| | - Klara Hlouchova
- Department of Cell Biology, Faculty of Science, Charles University, Prague, Czech Republic
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic
| |
Collapse
|
3
|
Valdivia-Francia F, Sendoel A. No country for old methods: New tools for studying microproteins. iScience 2024; 27:108972. [PMID: 38333695 PMCID: PMC10850755 DOI: 10.1016/j.isci.2024.108972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2024] Open
Abstract
Microproteins encoded by small open reading frames (sORFs) have emerged as a fascinating frontier in genomics. Traditionally overlooked due to their small size, recent technological advancements such as ribosome profiling, mass spectrometry-based strategies and advanced computational approaches have led to the annotation of more than 7000 sORFs in the human genome. Despite the vast progress, only a tiny portion of these microproteins have been characterized and an important challenge in the field lies in identifying functionally relevant microproteins and understanding their role in different cellular contexts. In this review, we explore the recent advancements in sORF research, focusing on the new methodologies and computational approaches that have facilitated their identification and functional characterization. Leveraging these new tools hold great promise for dissecting the diverse cellular roles of microproteins and will ultimately pave the way for understanding their role in the pathogenesis of diseases and identifying new therapeutic targets.
Collapse
Affiliation(s)
- Fabiola Valdivia-Francia
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
- Life Science Zurich Graduate School, Molecular Life Science Program, University of Zurich/ ETH Zurich, Schlieren-Zurich, Switzerland
| | - Ataman Sendoel
- University of Zurich, Institute for Regenerative Medicine (IREM), Wagistrasse 12, 8952 Schlieren-Zurich, Switzerland
| |
Collapse
|
4
|
Ferrareze PAG, Pereira E Costa RA, Thompson CE. Genomic characterization and molecular evolution of human monkeypox viruses. Arch Virol 2023; 168:278. [PMID: 37864757 DOI: 10.1007/s00705-023-05904-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 08/30/2023] [Indexed: 10/23/2023]
Abstract
Monkeypox virus is a member of the family Poxviridae, as are variola virus and vaccinia virus. It has a linear double-strand DNA genome approximately 197 kb long, containing ~190 non-overlapping ORFs. Comparison of members of the Central and West African clades shows the presence of unique genes that are associated with different disease presentations, depending on the strain. The last smallpox vaccination efforts ended in the mid-1980s, and there is concern about the recent spread of human monkeypox disease around the world. Almost 87,000 human monkeypox cases have been diagnosed in the world, of which more than 10,900 were in Brazil. The aim of this study was to evaluate the epidemiology and molecular evolution of hMpxV. From computational biology analysis of 640 hMpxV genomes from 1962 to 2022, synteny breaks and gene conservation were observed between Central and West clade genomes, and strains belonged with the 2022 outbreak assigned to the West African clade. Evidence was found for diversifying selective pressure at specific sites within protein coding sequences, acting on immunomodulatory processes. The existence of different sites under diversifying and purifying selection in paralog genes indicates adaptive mechanisms underlying the host-pathogen interaction of monkeypox virus in humans.
Collapse
Affiliation(s)
- Patrícia Aline Gröhs Ferrareze
- Graduate Program in Health Sciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), Porto Alegre, RS, Brazil
| | | | - Claudia Elizabeth Thompson
- Department of Pharmacosciences, Universidade Federal de Ciências da Saúde de Porto Alegre (UFCSPA), 245/200C Sarmento Leite St, Porto Alegre, RS, 90050-170, Brazil.
| |
Collapse
|
5
|
Hosea R, Hillary S, Wu S, Kasim V. Targeting Transcription Factor YY1 for Cancer Treatment: Current Strategies and Future Directions. Cancers (Basel) 2023; 15:3506. [PMID: 37444616 DOI: 10.3390/cancers15133506] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 06/28/2023] [Accepted: 07/03/2023] [Indexed: 07/15/2023] Open
Abstract
Cancer represents a significant and persistent global health burden, with its impact underscored by its prevalence and devastating consequences. Whereas numerous oncogenes could contribute to cancer development, a group of transcription factors (TFs) are overactive in the majority of tumors. Targeting these TFs may also combat the downstream oncogenes activated by the TFs, making them attractive potential targets for effective antitumor therapeutic strategy. One such TF is yin yang 1 (YY1), which plays crucial roles in the development and progression of various tumors. In preclinical studies, YY1 inhibition has shown efficacy in inhibiting tumor growth, promoting apoptosis, and sensitizing tumor cells to chemotherapy. Recent studies have also revealed the potential of combining YY1 inhibition with immunotherapy for enhanced antitumor effects. However, clinical translation of YY1-targeted therapy still faces challenges in drug specificity and delivery. This review provides an overview of YY1 biology, its role in tumor development and progression, as well as the strategies explored for YY1-targeted therapy, with a focus on their clinical implications, including those using small molecule inhibitors, RNA interference, and gene editing techniques. Finally, we discuss the challenges and current limitations of targeting YY1 and the need for further research in this area.
Collapse
Affiliation(s)
- Rendy Hosea
- Key Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400044, China
- The 111 Project Laboratory of Biomechanics and Tissue Repair, College of Bioengineering, Chongqing University, Chongqing 400044, China
| | - Sharon Hillary
- Key Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400044, China
- The 111 Project Laboratory of Biomechanics and Tissue Repair, College of Bioengineering, Chongqing University, Chongqing 400044, China
| | - Shourong Wu
- Key Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400044, China
- The 111 Project Laboratory of Biomechanics and Tissue Repair, College of Bioengineering, Chongqing University, Chongqing 400044, China
- Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, Chongqing University Cancer Hospital, Chongqing University, Chongqing 400030, China
| | - Vivi Kasim
- Key Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400044, China
- The 111 Project Laboratory of Biomechanics and Tissue Repair, College of Bioengineering, Chongqing University, Chongqing 400044, China
- Chongqing Key Laboratory of Translational Research for Cancer Metastasis and Individualized Treatment, Chongqing University Cancer Hospital, Chongqing University, Chongqing 400030, China
| |
Collapse
|
6
|
Inchingolo MA, Diman A, Adamczewski M, Humphreys T, Jaquier-Gubler P, Curran JA. TP53BP1, a dual-coding gene, uses promoter switching and translational reinitiation to express a smORF protein. iScience 2023; 26:106757. [PMID: 37216125 PMCID: PMC10193022 DOI: 10.1016/j.isci.2023.106757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 03/07/2023] [Accepted: 04/24/2023] [Indexed: 05/24/2023] Open
Abstract
The complexity of the metazoan proteome is significantly increased by the expression of small proteins (<100 aa) derived from smORFs within lncRNAs, uORFs, 3' UTRs and, reading frames overlapping the CDS. These smORF encoded proteins (SEPs) have diverse roles, ranging from the regulation of cellular physiological to essential developmental functions. We report the characterization of a new member of this protein family, SEP53BP1, derived from a small internal ORF that overlaps the CDS encoding 53BP1. Its expression is coupled to the utilization of an alternative, cell-type specific promoter coupled to translational reinitiation events mediated by a uORF in the alternative 5' TL of the mRNA. This uORF-mediated reinitiation at an internal ORF is also observed in zebrafish. Interactome studies indicate that the human SEP53BP1 associates with components of the protein turnover pathway including the proteasome, and the TRiC/CCT chaperonin complex, suggesting that it may play a role in cellular proteostasis.
Collapse
Affiliation(s)
- Marta A. Inchingolo
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Aurélie Diman
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Maxime Adamczewski
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Faculté de Médecine et Pharmacie, Université Grenoble Alpes, Grenoble, France
| | - Tom Humphreys
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Pascale Jaquier-Gubler
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Joseph A. Curran
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
- Institute of Genetics and Genomics of Geneva (iGE3), University of Geneva, Geneva, Switzerland
| |
Collapse
|
7
|
Pueyo JI, Salazar J, Grincho C, Berni J, Towler BP, Newbury SF. Purriato is a conserved small open reading frame gene that interacts with the CASA pathway to regulate muscle homeostasis and epithelial tissue growth in Drosophila. Front Cell Dev Biol 2023; 11:1117454. [PMID: 36968202 PMCID: PMC10036370 DOI: 10.3389/fcell.2023.1117454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 02/24/2023] [Indexed: 03/12/2023] Open
Abstract
Recent advances in proteogenomic techniques and bioinformatic pipelines have permitted the detection of thousands of translated small Open Reading Frames (smORFs), which contain less than 100 codons, in eukaryotic genomes. Hundreds of these actively translated smORFs display conserved sequence, structure and evolutionary signatures indicating that the translated peptides could fulfil important biological roles. Despite their abundance, only tens of smORF genes have been fully characterised; these act mainly as regulators of canonical proteins involved in essential cellular processes. Importantly, some of these smORFs display conserved functions with their mutations being associated with pathogenesis. Thus, investigating smORF roles in Drosophila will not only expand our understanding of their functions but it may have an impact in human health. Here we describe the function of a novel and essential Drosophila smORF gene named purriato (prto). prto belongs to an ancient gene family whose members have expanded throughout the Protostomia clade. prto encodes a transmembrane peptide which is localized in endo-lysosomes and perinuclear and plasma membranes. prto is dynamically expressed in mesodermal tissues and imaginal discs. Targeted prto knockdown (KD) in these organs results in changes in nuclear morphology and endo-lysosomal distributions correlating with the loss of sarcomeric homeostasis in muscles and reduction of mitosis in wing discs. Consequently, prto KD mutants display severe reduction of motility, and shorter wings. Finally, our genetic interaction experiments show that prto function is closely associated to the CASA pathway, a conserved mechanism involved in turnover of mis-folded proteins and linked to muscle dystrophies and neurodegenerative diseases. Thus, this study shows the relevance of smORFs in regulating important cellular functions and supports the systematic characterisation of this class of genes to understand their functions and evolution.
Collapse
Affiliation(s)
- Jose I. Pueyo
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jorge Salazar
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Carolina Grincho
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Jimena Berni
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| | - Benjamin P. Towler
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
- Department of Biochemistry and Biomedicine, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Sarah F. Newbury
- Brighton and Sussex Medical School, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
8
|
Kushwaha AK, Dwivedi S, Mukherjee A, Lingwan M, Dar MA, Bhagavatula L, Datta S. Plant microProteins: Small but powerful modulators of plant development. iScience 2022; 25:105400. [PMID: 36353725 PMCID: PMC9638782 DOI: 10.1016/j.isci.2022.105400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
MicroProteins (miPs) are small and single-domain containing proteins of less than 20 kDa. This domain allows microProteins to interact with compatible domains of evolutionary-related proteins and fine-tuning the key physiological pathways in several organisms. Since the first report of a microProtein in mice, numerous microProteins have been identified in plants by computational approaches. However, only a few candidates have been functionally characterized, primarily in Arabidopsis. The recent success of synthetic microProteins in modulating physiological activities in crops makes these proteins interesting candidates for crop engineering. Here, we comprehensively summarise the synthesis, mode of action, and functional roles of microProteins in plants. We also discuss different approaches used to identify plant microProteins. Additionally, we discuss novel approaches to design synthetic microProteins that can be used to target proteins regulating plant growth and development. We finally highlight the prospects and challenges of utilizing microProteins in future crop improvement programs. MicroProteins (miPs) are small-sized proteins with a molecular weight of 5–20 kDa MiPs can be detected through multiomics and computational approaches MiPs are crucial regulators of plant growth and development MiPs as condensates, synthetic miPs, and limitations
Collapse
|
9
|
Translation and natural selection of micropeptides from long non-canonical RNAs. Nat Commun 2022; 13:6515. [PMID: 36316320 PMCID: PMC9622821 DOI: 10.1038/s41467-022-34094-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 10/13/2022] [Indexed: 12/25/2022] Open
Abstract
Long noncoding RNAs (lncRNAs) are transcripts longer than 200 nucleotides but lacking canonical coding sequences. Apparently unable to produce peptides, lncRNA function seems to rely only on RNA expression, sequence and structure. Here, we exhaustively detect in-vivo translation of small open reading frames (small ORFs) within lncRNAs using Ribosomal profiling during Drosophila melanogaster embryogenesis. We show that around 30% of lncRNAs contain small ORFs engaged by ribosomes, leading to regulated translation of 100 to 300 micropeptides. We identify lncRNA features that favour translation, such as cistronicity, Kozak sequences, and conservation. For the latter, we develop a bioinformatics pipeline to detect small ORF homologues, and reveal evidence of natural selection favouring the conservation of micropeptide sequence and function across evolution. Our results expand the repertoire of lncRNA biochemical functions, and suggest that lncRNAs give rise to novel coding genes throughout evolution. Since most lncRNAs contain small ORFs with as yet unknown translation potential, we propose to rename them "long non-canonical RNAs".
Collapse
|
10
|
Chothani SP, Adami E, Widjaja AA, Langley SR, Viswanathan S, Pua CJ, Zhihao NT, Harmston N, D'Agostino G, Whiffin N, Mao W, Ouyang JF, Lim WW, Lim S, Lee CQE, Grubman A, Chen J, Kovalik JP, Tryggvason K, Polo JM, Ho L, Cook SA, Rackham OJL, Schafer S. A high-resolution map of human RNA translation. Mol Cell 2022; 82:2885-2899.e8. [PMID: 35841888 DOI: 10.1016/j.molcel.2022.06.023] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Revised: 03/10/2022] [Accepted: 06/15/2022] [Indexed: 10/17/2022]
Abstract
Translated small open reading frames (smORFs) can have important regulatory roles and encode microproteins, yet their genome-wide identification has been challenging. We determined the ribosome locations across six primary human cell types and five tissues and detected 7,767 smORFs with translational profiles matching those of known proteins. The human genome was found to contain highly cell-type- and tissue-specific smORFs and a subset that encodes highly conserved amino acid sequences. Changes in the translational efficiency of upstream-encoded smORFs (uORFs) and the corresponding main ORFs predominantly occur in the same direction. Integration with 456 mass-spectrometry datasets confirms the presence of 603 small peptides at the protein level in humans and provides insights into the subcellular localization of these small proteins. This study provides a comprehensive atlas of high-confidence translated smORFs derived from primary human cells and tissues in order to provide a more complete understanding of the translated human genome.
Collapse
Affiliation(s)
- Sonia P Chothani
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Eleonora Adami
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore; Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), 13125 Berlin, Germany
| | - Anissa A Widjaja
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Sarah R Langley
- Lee Kong Chian School of Medicine, Nanyang Technological University, Clinical Sciences Building, Singapore 308232, Singapore
| | - Sivakumar Viswanathan
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Chee Jian Pua
- National Heart Research Institute Singapore (NHRIS), National Heart Centre Singapore, Singapore 169609, Singapore
| | - Nevin Tham Zhihao
- Lee Kong Chian School of Medicine, Nanyang Technological University, Clinical Sciences Building, Singapore 308232, Singapore
| | - Nathan Harmston
- Program in Cancer and Stem Cell Biology, Duke-NUS Medical School, Singapore 169857, Singapore; Science Division, Yale-NUS College, Singapore 138527, Singapore
| | - Giuseppe D'Agostino
- Lee Kong Chian School of Medicine, Nanyang Technological University, Clinical Sciences Building, Singapore 308232, Singapore
| | - Nicola Whiffin
- Wellcome Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - Wang Mao
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - John F Ouyang
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Wei Wen Lim
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore; National Heart Research Institute Singapore (NHRIS), National Heart Centre Singapore, Singapore 169609, Singapore
| | - Shiqi Lim
- National Heart Research Institute Singapore (NHRIS), National Heart Centre Singapore, Singapore 169609, Singapore
| | - Cheryl Q E Lee
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Alexandra Grubman
- Department of Anatomy and Developmental Biology, Monash University, Wellington Road, Clayton, VIC 3800, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Wellington Road, Clayton, VIC 3800, Australia; Australian Regenerative Medicine Institute, Monash University, Wellington Road, Clayton, VIC 3800, Australia
| | - Joseph Chen
- Department of Anatomy and Developmental Biology, Monash University, Wellington Road, Clayton, VIC 3800, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Wellington Road, Clayton, VIC 3800, Australia; Australian Regenerative Medicine Institute, Monash University, Wellington Road, Clayton, VIC 3800, Australia
| | - J P Kovalik
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Karl Tryggvason
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Jose M Polo
- Department of Anatomy and Developmental Biology, Monash University, Wellington Road, Clayton, VIC 3800, Australia; Development and Stem Cells Program, Monash Biomedicine Discovery Institute, Wellington Road, Clayton, VIC 3800, Australia; Australian Regenerative Medicine Institute, Monash University, Wellington Road, Clayton, VIC 3800, Australia
| | - Lena Ho
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore
| | - Stuart A Cook
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore; National Heart Research Institute Singapore (NHRIS), National Heart Centre Singapore, Singapore 169609, Singapore; London Institute of Medical Sciences, London W12 ONN, UK
| | - Owen J L Rackham
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore; School of Biological Sciences, University of Southampton, Southampton, UK.
| | - Sebastian Schafer
- Program in Cardiovascular and Metabolic Disorders, Duke-National University of Singapore, Singapore 169857, Singapore; National Heart Research Institute Singapore (NHRIS), National Heart Centre Singapore, Singapore 169609, Singapore.
| |
Collapse
|
11
|
Han T, Cong H, Yu B, Shen Y. Application of peptide biomarkers in life analysis based on liquid chromatography-mass spectrometry technology. Biofactors 2022; 48:725-743. [PMID: 35816279 DOI: 10.1002/biof.1875] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 06/18/2022] [Indexed: 12/11/2022]
Abstract
Biomedicine is developing rapidly in the 21st century. Among them, the qualitative and quantitative analysis of peptide biomarkers is of considerable importance for the diagnosis and therapy of diseases and the quality evaluation of drugs and food. The identification and quantitative analysis of peptides have been going on for decades. Traditionally, immunoassays or biological assays are generally used to quantify peptides in biological matrices. However, the selectivity and sensitivity of these methods cannot meet the requirements of the application. The separation and analysis technique of liquid chromatography-mass spectrometry (LC-MS) supplies a reliable alternative. In contrast to immunoassays, LC-MS methods are capable of providing the analytical prowess necessary to satisfy the demands of peptide biomarker research in the life sciences arena. This review article provides a historical account of the in-roads made by LC-MS technology for the detection of peptide biomarkers in the past 10 years, with the focus on the qualification/quantification developments and their applications.
Collapse
Affiliation(s)
- Tingting Han
- Institute of Biomedical Materials and Engineering, College of Chemistry and Chemical Engineering, College of Materials Science and Engineering, Qingdao University, Qingdao, China
| | - Hailin Cong
- Institute of Biomedical Materials and Engineering, College of Chemistry and Chemical Engineering, College of Materials Science and Engineering, Qingdao University, Qingdao, China
- State Key Laboratory of Bio-Fibers and Eco-Textiles, Qingdao University, Qingdao, China
| | - Bing Yu
- Institute of Biomedical Materials and Engineering, College of Chemistry and Chemical Engineering, College of Materials Science and Engineering, Qingdao University, Qingdao, China
- State Key Laboratory of Bio-Fibers and Eco-Textiles, Qingdao University, Qingdao, China
| | - Youqing Shen
- Institute of Biomedical Materials and Engineering, College of Chemistry and Chemical Engineering, College of Materials Science and Engineering, Qingdao University, Qingdao, China
- Center for Bionanoengineering and Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China
| |
Collapse
|
12
|
Pan J, Wang R, Shang F, Ma R, Rong Y, Zhang Y. Functional Micropeptides Encoded by Long Non-Coding RNAs: A Comprehensive Review. Front Mol Biosci 2022; 9:817517. [PMID: 35769907 PMCID: PMC9234465 DOI: 10.3389/fmolb.2022.817517] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 05/24/2022] [Indexed: 12/03/2022] Open
Abstract
Long non-coding RNAs (lncRNAs) were originally defined as non-coding RNAs (ncRNAs) which lack protein-coding ability. However, with the emergence of technologies such as ribosome profiling sequencing and ribosome-nascent chain complex sequencing, it has been demonstrated that most lncRNAs have short open reading frames hence the potential to encode functional micropeptides. Such micropeptides have been described to be widely involved in life-sustaining activities in several organisms, such as homeostasis regulation, disease, and tumor occurrence, and development, and morphological development of animals, and plants. In this review, we focus on the latest developments in the field of lncRNA-encoded micropeptides, and describe the relevant computational tools and techniques for micropeptide prediction and identification. This review aims to serve as a reference for future research studies on lncRNA-encoded micropeptides.
Collapse
Affiliation(s)
- Jianfeng Pan
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Ruijun Wang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- Key Laboratory of Mutton Sheep Genetics and Breeding, Ministry of Agriculture, Hohhot, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Hohhot, China
- Engineering Research Center for Goat Genetics and Breeding, Hohhot, China
| | - Fangzheng Shang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Rong Ma
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Youjun Rong
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
| | - Yanjun Zhang
- College of Animal Science, Inner Mongolia Agricultural University, Hohhot, China
- Key Laboratory of Mutton Sheep Genetics and Breeding, Ministry of Agriculture, Hohhot, China
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Hohhot, China
- Engineering Research Center for Goat Genetics and Breeding, Hohhot, China
- *Correspondence: Yanjun Zhang,
| |
Collapse
|
13
|
Cancer-related micropeptides encoded by ncRNAs: Promising drug targets and prognostic biomarkers. Cancer Lett 2022; 547:215723. [DOI: 10.1016/j.canlet.2022.215723] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 04/14/2022] [Accepted: 05/01/2022] [Indexed: 02/07/2023]
|
14
|
Yuanyuan J, Xinqiang Y. Micropeptides Identified from Human Genomes. J Proteome Res 2022; 21:865-873. [DOI: 10.1021/acs.jproteome.1c00889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jing Yuanyuan
- School of Public Health, North Sichuan Medical College, Nanchong 637000, China
| | - Yin Xinqiang
- School of Basic Medicine and Forensics, North Sichuan Medical College, Nanchong 637000, China
| |
Collapse
|
15
|
Li Y, Zhang Y, Pan T, Zhou P, Zhou W, Gao Y, Zheng S, Xu J. Shedding light on the hidden human proteome expands immunopeptidome in cancer. Brief Bioinform 2022; 23:6533503. [PMID: 35189633 DOI: 10.1093/bib/bbac034] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 01/07/2022] [Accepted: 01/25/2022] [Indexed: 01/04/2023] Open
Abstract
Unrestrained cellular growth and immune escape of a tumor are associated with the incidental errors of the genome and transcriptome. Advances in next-generation sequencing have identified thousands of genomic and transcriptomic aberrations that generate variant peptides that assemble the hidden proteome, further expanding the immunopeptidome. Emerging next-generation sequencing technologies and a number of computational methods estimated the abundance of immune infiltration from bulk transcriptome have advanced our understanding of tumor microenvironments. Here, we will characterize several major types of tumor-specific antigens arising from single-nucleotide variants, insertions and deletions, gene fusion, alternative splicing, RNA editing and non-coding RNAs. Finally, we summarize the current state-of-the-art computational and experimental approaches or resources and provide an integrative pipeline for the identification of candidate tumor antigens. Together, the systematic investigation of the hidden proteome in cancer will help facilitate the development of effective and durable immunotherapy targets for cancer.
Collapse
Affiliation(s)
- Yongsheng Li
- College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Yunpeng Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Tao Pan
- College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Ping Zhou
- Department of Radiotherapy, the First Affiliated Hospital of Hainan Medical University, Hainan, China
| | - Weiwei Zhou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| | - Yueying Gao
- College of Biomedical Information and Engineering, Hainan Women and Children's Medical Center, Hainan Medical University, Haikou 571199, China
| | - Shaojiang Zheng
- Key Laboratory of Emergency and Trauma of Ministry of Education, Tumor Institute of the First Affiliated Hospital, Hainan Medical University, Haikou, 571199, China
| | - Juan Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China
| |
Collapse
|
16
|
Cardon T, Fournier I, Salzet M. Unveiling a Ghost Proteome in the Glioblastoma Non-Coding RNAs. Front Cell Dev Biol 2022; 9:703583. [PMID: 35004666 PMCID: PMC8733697 DOI: 10.3389/fcell.2021.703583] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Accepted: 12/03/2021] [Indexed: 12/13/2022] Open
Abstract
Glioblastoma is the most common brain cancer in adults. Nevertheless, the median survival time is 15 months, if treated with at least a near total resection and followed by radiotherapy in association with temozolomide. In glioblastoma (GBM), variations of non-coding ribonucleic acid (ncRNA) expression have been demonstrated in tumor processes, especially in the regulation of major signaling pathways. Moreover, many ncRNAs present in their sequences an Open Reading Frame (ORF) allowing their translations into proteins, so-called alternative proteins (AltProt) and constituting the “ghost proteome.” This neglected world in GBM has been shown to be implicated in protein–protein interaction (PPI) with reference proteins (RefProt) reflecting involvement in signaling pathways linked to cellular mobility and transfer RNA regulation. More recently, clinical studies have revealed that AltProt is also involved in the patient’s survival and bad prognosis. We thus propose to review the ncRNAs involved in GBM and highlight their function in the disease.
Collapse
Affiliation(s)
- Tristan Cardon
- University of Lille, Inserm, CHU Lille, U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM, Lille, France
| | - Isabelle Fournier
- University of Lille, Inserm, CHU Lille, U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM, Lille, France.,Institut Universitaire de France, Paris, France
| | - Michel Salzet
- University of Lille, Inserm, CHU Lille, U1192-Protéomique Réponse Inflammatoire Spectrométrie de Masse-PRISM, Lille, France.,Institut Universitaire de France, Paris, France
| |
Collapse
|
17
|
Andreev DE, Baranov PV, Milogorodskii A, Rachinskii D. A deterministic model for non-monotone relationship between translation of upstream and downstream open reading frames. MATHEMATICAL MEDICINE AND BIOLOGY : A JOURNAL OF THE IMA 2021; 38:490-515. [PMID: 34718568 DOI: 10.1093/imammb/dqab015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 08/12/2021] [Accepted: 10/06/2021] [Indexed: 01/01/2023]
Abstract
Totally asymmetric simple exclusion process (TASEP) modelling was shown to offer a parsimonious explanation for the experimentally confirmed ability of a single upstream open reading frames (uORFs) to upregulate downstream translation during the integrated stress response. As revealed by numerical simulations, the model predicts that reducing the density of scanning ribosomes upstream of certain uORFs increases the flow of ribosomes downstream. To gain a better insight into the mechanism which ensures the non-monotone relation between the upstream and downstream flows, in this work, we propose a phenomenological deterministic model approximating the TASEP model of the translation process. We establish the existence of a stationary solution featuring the decreasing density along the uORF for the deterministic model. Further, we find an explicit non-monotone relation between the upstream ribosome density and the downstream flow for the stationary solution in the limit of increasing uORF length and increasingly leaky initiation. The stationary distribution of the TASEP model, the stationary solution of the deterministic model and the explicit limit are compared numerically.
Collapse
Affiliation(s)
- D E Andreev
- Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow, 119991, Russian Federation, and Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
| | - P V Baranov
- University College Cork, College Road, Cork, T12 K8AF, Ireland, and Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry (RAS), 16/10 Miklukho-Maklay str., Moscow, 117997, Russian Federation
| | - A Milogorodskii
- Lomonosov Moscow State University, GSP-1, Leninskie Gory, Moscow, 119991, Russian Federation, and Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, RAS, Moscow, Russia
| | - D Rachinskii
- Department of Mathematical Sciences, The University of Texas at Dallas, 800 W. Campbell Rd, Richardson, TX 75080, USA
| |
Collapse
|
18
|
Chatterjee O, Gopalakrishnan L, Mol P, Advani J, Nair B, Shankar SK, Mahadevan A, Prasad TSK. The Normal Human Adult Hypothalamus Proteomic Landscape: Rise of Neuroproteomics in Biological Psychiatry and Systems Biology. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2021; 25:693-710. [PMID: 34714154 DOI: 10.1089/omi.2021.0158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The human hypothalamus is central to the regulation of neuroendocrine and neurovegetative systems, as well as modulation of chronobiology and behavioral aspects in human health and disease. Surprisingly, a deep proteomic analysis of the normal human hypothalamic proteome has been missing for such an important organ so far. In this study, we delineated the human hypothalamus proteome using a high-resolution mass spectrometry approach which resulted in the identification of 5349 proteins, while a multiple post-translational modification (PTM) search identified 191 additional proteins, which were missed in the first search. A proteogenomic analysis resulted in the discovery of multiple novel protein-coding regions as we identified proteins from noncoding regions (pseudogenes) and proteins translated from short open reading frames that can be missed using the traditional pipeline of prediction of protein-coding genes as a part of genome annotation. We also identified several PTMs of hypothalamic proteins that may be required for normal hypothalamic functions. Moreover, we observed an enrichment of proteins pertaining to autophagy and adult neurogenesis in the proteome data. We believe that the hypothalamic proteome reported herein would help to decipher the molecular basis for the diverse range of physiological functions attributed to it, as well as its role in neurological and psychiatric diseases. Extensive proteomic profiling of the hypothalamic nuclei would further elaborate on the role and functional characterization of several hypothalamus-specific proteins and pathways to inform future research and clinical discoveries in biological psychiatry, neurology, and system biology.
Collapse
Affiliation(s)
- Oishi Chatterjee
- Institute of Bioinformatics, Bangalore India.,Amrita School of Biotechnology, Amrita University, Kollam, India.,Center for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India
| | - Lathika Gopalakrishnan
- Institute of Bioinformatics, Bangalore India.,Center for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India.,Manipal Academy of Higher Education, Manipal, India
| | - Praseeda Mol
- Institute of Bioinformatics, Bangalore India.,Amrita School of Biotechnology, Amrita University, Kollam, India
| | | | - Bipin Nair
- Amrita School of Biotechnology, Amrita University, Kollam, India
| | - Susarla Krishna Shankar
- Department of Neuropathology, National Institute of Mental Health and Neurosciences, Bangalore, India.,Human Brain Tissue Repository, National Institute of Mental Health and Neurosciences, Bangalore, India
| | - Anita Mahadevan
- Department of Neuropathology, National Institute of Mental Health and Neurosciences, Bangalore, India.,Human Brain Tissue Repository, National Institute of Mental Health and Neurosciences, Bangalore, India
| | | |
Collapse
|
19
|
Sajjad N, Wang S, Liu P, Chen JL, Chi X, Liu S, Ma S. Functional Roles of Non-coding RNAs in the Interaction Between Host and Influenza A Virus. Front Microbiol 2021; 12:742984. [PMID: 34745043 PMCID: PMC8569443 DOI: 10.3389/fmicb.2021.742984] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Accepted: 10/04/2021] [Indexed: 11/13/2022] Open
Abstract
Non-coding RNAs (ncRNAs) are extensively expressed in various cells and tissues, and studies have shown that ncRNAs play significant roles in cell regulation. However, in the past few decades, the knowledge of ncRNAs has been increased dramatically due to their transcriptional ability and multiple regulatory functions. Typically, regulatory ncRNAs include long ncRNAs (lncRNAs), miRNAs, piRNAs, Y RNAs, vault RNAs, and circular RNAs (circRNAs), etc. Previous studies have revealed that various ncRNAs are involved in the host responses to virus infection and play critical roles in the regulation of host-virus interactions. In this review, we discuss the conceptual framework and biological regulations of ncRNAs to elucidate their functions in response to viral infection, especially influenza A virus (IAV) infection. In addition, we summarize the ncRNAs that are associated with innate immunity and involvement of interferons and their stimulated genes (ISGs) during IAV infection.
Collapse
Affiliation(s)
- Nelam Sajjad
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Song Wang
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Ping Liu
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Ji-Long Chen
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
- CAS Key Laboratory of Pathogenic Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Xiaojuan Chi
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shasha Liu
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shujie Ma
- Key Laboratory of Fujian-Taiwan Animal Pathogen Biology, College of Animal Sciences, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
20
|
Dib A, Zanet J, Mancheno-Ferris A, Gallois M, Markus D, Valenti P, Marques-Prieto S, Plaza S, Kageyama Y, Chanut-Delalande H, Payre F. Pri smORF Peptides Are Wide Mediators of Ecdysone Signaling, Contributing to Shape Spatiotemporal Responses. Front Genet 2021; 12:714152. [PMID: 34527021 DOI: 10.3389/fgene.2021.714152] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Accepted: 07/28/2021] [Indexed: 11/13/2022] Open
Abstract
There is growing evidence that peptides encoded by small open-reading frames (sORF or smORF) can fulfill various cellular functions and define a novel class regulatory molecules. To which extend transcripts encoding only smORF peptides compare with canonical protein-coding genes, yet remain poorly understood. In particular, little is known on whether and how smORF-encoding RNAs might need tightly regulated expression within a given tissue, at a given time during development. We addressed these questions through the analysis of Drosophila polished rice (pri, a.k.a. tarsal less or mille pattes), which encodes four smORF peptides (11-32 amino acids in length) required at several stages of development. Previous work has shown that the expression of pri during epidermal development is regulated in the response to ecdysone, the major steroid hormone in insects. Here, we show that pri transcription is strongly upregulated by ecdysone across a large panel of cell types, suggesting that pri is a core component of ecdysone response. Although pri is produced as an intron-less short transcript (1.5 kb), genetic assays reveal that the developmental functions of pri require an unexpectedly large array of enhancers (spanning over 50 kb), driving a variety of spatiotemporal patterns of pri expression across developing tissues. Furthermore, we found that separate pri enhancers are directly activated by the ecdysone nuclear receptor (EcR) and display distinct regulatory modes between developmental tissues and/or stages. Alike major developmental genes, the expression of pri in a given tissue often involves several enhancers driving apparently redundant (or shadow) expression, while individual pri enhancers can harbor pleiotropic functions across tissues. Taken together, these data reveal the broad role of Pri smORF peptides in ecdysone signaling and show that the cis-regulatory architecture of the pri gene contributes to shape distinct spatial and temporal patterns of ecdysone response throughout development.
Collapse
Affiliation(s)
- Azza Dib
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Jennifer Zanet
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Alexandra Mancheno-Ferris
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Maylis Gallois
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Damien Markus
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Philippe Valenti
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Simon Marques-Prieto
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Serge Plaza
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - Yuji Kageyama
- Department of Biology, Graduate School of Science, Kobe University, Kobe, Japan.,Biosignal Research Center, Kobe University, Kobe, Japan
| | - Hélène Chanut-Delalande
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| | - François Payre
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), CNRS, UPS, University of Toulouse, Toulouse, France
| |
Collapse
|
21
|
Peeters MKR, Baggerman G, Gabriels R, Pepermans E, Menschaert G, Boonen K. Ion Mobility Coupled to a Time-of-Flight Mass Analyzer Combined With Fragment Intensity Predictions Improves Identification of Classical Bioactive Peptides and Small Open Reading Frame-Encoded Peptides. Front Cell Dev Biol 2021; 9:720570. [PMID: 34604223 PMCID: PMC8484717 DOI: 10.3389/fcell.2021.720570] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 08/25/2021] [Indexed: 12/29/2022] Open
Abstract
Bioactive peptides exhibit key roles in a wide variety of complex processes, such as regulation of body weight, learning, aging, and innate immune response. Next to the classical bioactive peptides, emerging from larger precursor proteins by specific proteolytic processing, a new class of peptides originating from small open reading frames (sORFs) have been recognized as important biological regulators. But their intrinsic properties, specific expression pattern and location on presumed non-coding regions have hindered the full characterization of the repertoire of bioactive peptides, despite their predominant role in various pathways. Although the development of peptidomics has offered the opportunity to study these peptides in vivo, it remains challenging to identify the full peptidome as the lack of cleavage enzyme specification and large search space complicates conventional database search approaches. In this study, we introduce a proteogenomics methodology using a new type of mass spectrometry instrument and the implementation of machine learning tools toward improved identification of potential bioactive peptides in the mouse brain. The application of trapped ion mobility spectrometry (tims) coupled to a time-of-flight mass analyzer (TOF) offers improved sensitivity, an enhanced peptide coverage, reduction in chemical noise and the reduced occurrence of chimeric spectra. Subsequent machine learning tools MS2PIP, predicting fragment ion intensities and DeepLC, predicting retention times, improve the database searching based on a large and comprehensive custom database containing both sORFs and alternative ORFs. Finally, the identification of peptides is further enhanced by applying the post-processing semi-supervised learning tool Percolator. Applying this workflow, the first peptidomics workflow combined with spectral intensity and retention time predictions, we identified a total of 167 predicted sORF-encoded peptides, of which 48 originating from presumed non-coding locations, next to 401 peptides from known neuropeptide precursors, linked to 66 annotated bioactive neuropeptides from within 22 different families. Additional PEAKS analysis expanded the pool of SEPs on presumed non-coding locations to 84, while an additional 204 peptides completed the list of peptides from neuropeptide precursors. Altogether, this study provides insights into a new robust pipeline that fuses technological advancements from different fields ensuring an improved coverage of the neuropeptidome in the mouse brain.
Collapse
Affiliation(s)
- Marlies K. R. Peeters
- BioBix, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
| | - Geert Baggerman
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Unit Environmental Risk and Health, Flemish Institute for Technological Research, Mol, Belgium
| | - Ralf Gabriels
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- VIB-UGent Center for Medical Biotechnology, Flanders Institute for Biotechnology, Ghent, Belgium
| | - Elise Pepermans
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Unit Environmental Risk and Health, Flemish Institute for Technological Research, Mol, Belgium
| | - Gerben Menschaert
- BioBix, Department of Data Analysis and Mathematical Modelling, Ghent University, Ghent, Belgium
- OHMX.bio, Ghent, Belgium
| | - Kurt Boonen
- Centre for Proteomics, University of Antwerp, Antwerp, Belgium
- Unit Environmental Risk and Health, Flemish Institute for Technological Research, Mol, Belgium
| |
Collapse
|
22
|
Abstract
Transcriptional regulators that integrate cellular and environmental signals to control cell division are well known in bacteria and eukaryotes, but their existence is poorly understood in archaea. We identified a conserved gene (cdrS) that encodes a small protein and is highly transcribed in the model archaeon Haloferax volcanii. The cdrS gene could not be deleted, but CRISPR interference (CRISPRi)-mediated repression of the cdrS gene caused slow growth and cell division defects and changed the expression of multiple genes and their products associated with cell division, protein degradation, and metabolism. Consistent with this complex regulatory network, overexpression of cdrS inhibited cell division, whereas overexpression of the operon encoding both CdrS and a tubulin-like cell division protein (FtsZ2) stimulated division. Chromatin immunoprecipitation-DNA sequencing (ChIP-Seq) identified 18 DNA-binding sites of the CdrS protein, including one upstream of the promoter for a cell division gene, ftsZ1, and another upstream of the essential gene dacZ, encoding diadenylate cyclase involved in c-di-AMP signaling, which is implicated in the regulation of cell division. These findings suggest that CdrS is a transcription factor that plays a central role in a regulatory network coordinating metabolism and cell division. IMPORTANCE Cell division is a central mechanism of life and is essential for growth and development. Members of the Bacteria and Eukarya have different mechanisms for cell division, which have been studied in detail. In contrast, cell division in members of the Archaea is still understudied, and its regulation is poorly understood. Interestingly, different cell division machineries appear in members of the Archaea, with the Euryarchaeota using a cell division apparatus based on the tubulin-like cytoskeletal protein FtsZ, as in bacteria. Here, we identify the small protein CdrS as essential for survival and a central regulator of cell division in the euryarchaeon Haloferax volcanii. CdrS also appears to coordinate other cellular pathways, including synthesis of signaling molecules and protein degradation. Our results show that CdrS plays a sophisticated role in cell division, including regulation of numerous associated genes. These findings are expected to initiate investigations into conditional regulation of division in archaea.
Collapse
|
23
|
Ranjbar R, Behjatfar M, Teimouri A, Aghaie Fard A, Maniati M, Taheri-Anganeh M. Long non-coding RNAs and microorganism-associated cancers. Cell Biochem Funct 2021; 39:844-853. [PMID: 34227160 DOI: 10.1002/cbf.3657] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Revised: 06/17/2021] [Accepted: 06/25/2021] [Indexed: 12/26/2022]
Abstract
Cancerous cells are abnormal cells characterized by aberrant growth and proliferation, which can involve various types of cells and tissues. Through numerous signalling pathways, many mechanisms are involved in cells that keep them normal. These signalling pathways are tightly set by different proteins whose expression is regulated by a large number of factors. In other words, when a regulating factor does not act properly or undergoes a change in its function or expression, the result will be that the subordinate gene and subsequently the related protein will show deranged expression and activity. This leads to disordered signalling pathways which bring about uncontrolled proliferation in cells. One of the most significant factors in adjusting the expression of genes is noncoding RNAs. It should be noted that all underlying causes initiating malignancy try to alter the main regulatory factors in cellular processes and gene expression and direct the cell to an unregulated state. Microorganisms have been identified as one of the important elements to direct normal cells to abnormality. That is, they probably agitate the malignant traits through manipulating significant factors such as ncRNAs in given cells using their own or host-related factors. The present study is aimed at examining how the long noncoding RNAs are involved in microorganism-mediated cancers.
Collapse
Affiliation(s)
- Reza Ranjbar
- Molecular Biology Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Mojtaba Behjatfar
- Department of Electrical Engineering, Kazeroon Branch, Islamic Azad University, Kazeroon, Iran
| | - Ali Teimouri
- Student Research Committee, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Arad Aghaie Fard
- Department of Hematology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Mahmood Maniati
- English Department, School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Mortaza Taheri-Anganeh
- Department of Medical Biotechnology, School of Advanced Medical Sciences and Technologies, Shiraz University of Medical Sciences, Shiraz, Iran
| |
Collapse
|
24
|
Choteau SA, Wagner A, Pierre P, Spinelli L, Brun C. MetamORF: a repository of unique short open reading frames identified by both experimental and computational approaches for gene and metagene analyses. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6307706. [PMID: 34156446 PMCID: PMC8218702 DOI: 10.1093/database/baab032] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 04/08/2021] [Accepted: 05/17/2021] [Indexed: 11/12/2022]
Abstract
The development of high-throughput technologies revealed the existence of non-canonical short open reading frames (sORFs) on most eukaryotic ribonucleic acids. They are ubiquitous genetic elements conserved across species and suspected to be involved in numerous cellular processes. MetamORF (https://metamorf.hb.univ-amu.fr/) aims to provide a repository of unique sORFs identified in the human and mouse genomes with both experimental and computational approaches. By gathering publicly available sORF data, normalizing them and summarizing redundant information, we were able to identify a total of 1 162 675 unique sORFs. Despite the usual characterization of ORFs as short, upstream or downstream, there is currently no clear consensus regarding the definition of these categories. Thus, the data have been reprocessed using a normalized nomenclature. MetamORF enables new analyses at locus, gene, transcript and ORF levels, which should offer the possibility to address new questions regarding sORF functions in the future. The repository is available through an user-friendly web interface, allowing easy browsing, visualization, filtering over multiple criteria and export possibilities. sORFs can be searched starting from a gene, a transcript and an ORF ID, looking in a genome area or browsing the whole repository for a species. The database content has also been made available through track hubs at UCSC Genome Browser. Finally, we demonstrated an enrichment of genes harboring upstream ORFs among genes expressed in response to reticular stress. Database URL https://metamorf.hb.univ-amu.fr/.
Collapse
Affiliation(s)
- Sebastien A Choteau
- Aix-Marseille University, INSERM, TAGC, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France.,Aix-Marseille University, INSERM, CNRS, CIML, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France
| | - Audrey Wagner
- Aix-Marseille University, INSERM, TAGC, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France
| | - Philippe Pierre
- Aix-Marseille University, INSERM, CNRS, CIML, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France.,Department of Medical Sciences, Institute for Research in Biomedicine (iBiMED) and Ilidio Pinho Foundation, University of Aveiro, Aveiro 3810-193, Portugal.,Shanghai Institute of Immunology, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Lionel Spinelli
- Aix-Marseille University, INSERM, TAGC, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France.,Aix-Marseille University, INSERM, CNRS, CIML, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France
| | - Christine Brun
- Aix-Marseille University, INSERM, TAGC, Turing Centre for Living Systems, 163 Avenue de Luminy, Marseille 13009, France.,CNRS, 31 Chemin Joseph Aiguier, Marseille 13009, France
| |
Collapse
|
25
|
Zhang Q, Wu E, Tang Y, Cai T, Zhang L, Wang J, Hao Y, Zhang B, Zhou Y, Guo X, Luo J, Chen R, Yang F. Deeply Mining a Universe of Peptides Encoded by Long Noncoding RNAs. Mol Cell Proteomics 2021; 20:100109. [PMID: 34129944 PMCID: PMC8335655 DOI: 10.1016/j.mcpro.2021.100109] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 05/16/2021] [Accepted: 06/02/2021] [Indexed: 11/22/2022] Open
Abstract
Many small ORFs embedded in long noncoding RNA (lncRNA) transcripts have been shown to encode biologically functional polypeptides (small ORF-encoded polypeptides [SEPs]) in different organisms. Despite some novel SEPs have been found, the identification is still hampered by their poor predictability, diminutive size, and low relative abundance. Here, we take advantage of NONCODE, a repository containing the most complete collection and annotation of lncRNA transcripts from different species, to build a novel database that attempts to maximize a collection of SEPs from human and mouse lncRNA transcripts. In order to further improve SEP discovery, we implemented two effective and complementary polypeptide enrichment strategies using 30-kDa molecular weight cutoff filter and C8 solid-phase extraction column. These combined strategies enabled us to discover 353 SEPs from eight human cell lines and 409 SEPs from three mouse cell lines and eight mouse tissues. Importantly, 19 of them were then verified through in vitro expression, immunoblotting, parallel reaction monitoring, and synthetic peptides. Subsequent bioinformatics analysis revealed that some of the physical and chemical properties of these novel SEPs, including amino acid composition and codon usage, are different from those commonly found in canonical proteins. Intriguingly, nearly 65% of the identified SEPs were found to be initiated with non-AUG start codons. The 762 novel SEPs probably represent the largest number of SEPs detected by MS reported to date. These novel SEPs might not only provide new clues for the annotation of noncoding elements in the genome but also serve as a valuable resource for functional study.
Collapse
Affiliation(s)
- Qing Zhang
- Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Erzhong Wu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yiheng Tang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Tanxi Cai
- Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Lili Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Jifeng Wang
- Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Yajing Hao
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Bao Zhang
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yue Zhou
- Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Thermofisher Scientific, Shanghai, China
| | - Xiaojing Guo
- Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Jianjun Luo
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.
| | - Runsheng Chen
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China; Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; Guangdong Geneway Decoding Bio-Tech Co Ltd, Foshan, China.
| | - Fuquan Yang
- Laboratory of Protein and Peptide Pharmaceuticals & Laboratory of Proteomics, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
26
|
Immarigeon C, Frei Y, Delbare SYN, Gligorov D, Machado Almeida P, Grey J, Fabbro L, Nagoshi E, Billeter JC, Wolfner MF, Karch F, Maeda RK. Identification of a micropeptide and multiple secondary cell genes that modulate Drosophila male reproductive success. Proc Natl Acad Sci U S A 2021; 118:e2001897118. [PMID: 33876742 PMCID: PMC8053986 DOI: 10.1073/pnas.2001897118] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Even in well-characterized genomes, many transcripts are considered noncoding RNAs (ncRNAs) simply due to the absence of large open reading frames (ORFs). However, it is now becoming clear that many small ORFs (smORFs) produce peptides with important biological functions. In the process of characterizing the ribosome-bound transcriptome of an important cell type of the seminal fluid-producing accessory gland of Drosophila melanogaster, we detected an RNA, previously thought to be noncoding, called male-specific abdominal (msa). Notably, msa is nested in the HOX gene cluster of the Bithorax complex and is known to contain a micro-RNA within one of its introns. We find that this RNA encodes a "micropeptide" (9 or 20 amino acids, MSAmiP) that is expressed exclusively in the secondary cells of the male accessory gland, where it seems to accumulate in nuclei. Importantly, loss of function of this micropeptide causes defects in sperm competition. In addition to bringing insights into the biology of a rare cell type, this work underlines the importance of small peptides, a class of molecules that is now emerging as important actors in complex biological processes.
Collapse
Affiliation(s)
- Clément Immarigeon
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland;
| | - Yohan Frei
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland
| | - Sofie Y N Delbare
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853-2703
| | - Dragan Gligorov
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland
| | - Pedro Machado Almeida
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland
| | - Jasmine Grey
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853-2703
| | - Léa Fabbro
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland
| | - Emi Nagoshi
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland
| | - Jean-Christophe Billeter
- Groningen Institute for Evolutionary Life Sciences, University of Groningen, Groningen 9700 CC, The Netherlands
| | - Mariana F Wolfner
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853-2703
| | - François Karch
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland
| | - Robert K Maeda
- Department of Genetics and Evolution, Sciences III, University of Geneva, 1211 Geneva 4, Switzerland;
| |
Collapse
|
27
|
Vitorino R, Guedes S, Amado F, Santos M, Akimitsu N. The role of micropeptides in biology. Cell Mol Life Sci 2021; 78:3285-3298. [PMID: 33507325 PMCID: PMC11073438 DOI: 10.1007/s00018-020-03740-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Revised: 12/01/2020] [Accepted: 12/11/2020] [Indexed: 12/11/2022]
Abstract
Micropeptides are small polypeptides coded by small open-reading frames. Progress in computational biology and the analyses of large-scale transcriptomes and proteomes have revealed that mammalian genomes produce a large number of transcripts encoding micropeptides. Many of these have been previously annotated as long noncoding RNAs. The role of micropeptides in cellular homeostasis maintenance has been demonstrated. This review discusses different types of micropeptides as well as methods to identify them, such as computational approaches, ribosome profiling, and mass spectrometry.
Collapse
Affiliation(s)
- Rui Vitorino
- Departamento de Cirurgia E Fisiologia, Faculdade de Medicina da Universidade Do Porto, UnIC, Porto, Portugal.
- Department of Medical Sciences, iBiMED, University of Aveiro, Aveiro, Portugal.
| | - Sofia Guedes
- Departamento de Química, LAQV-REQUIMTE, Universidade de Aveiro, Aveiro, Portugal
- Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Francisco Amado
- Departamento de Química, LAQV-REQUIMTE, Universidade de Aveiro, Aveiro, Portugal
- Department of Chemistry, University of Aveiro, Aveiro, Portugal
| | - Manuel Santos
- Department of Medical Sciences, iBiMED, University of Aveiro, Aveiro, Portugal
| | | |
Collapse
|
28
|
Steinberg R, Koch HG. The largely unexplored biology of small proteins in pro- and eukaryotes. FEBS J 2021; 288:7002-7024. [PMID: 33780127 DOI: 10.1111/febs.15845] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/11/2021] [Accepted: 03/26/2021] [Indexed: 12/29/2022]
Abstract
The large abundance of small open reading frames (smORFs) in prokaryotic and eukaryotic genomes and the plethora of smORF-encoded small proteins became only apparent with the constant advancements in bioinformatic, genomic, proteomic, and biochemical tools. Small proteins are typically defined as proteins of < 50 amino acids in prokaryotes and of less than 100 amino acids in eukaryotes, and their importance for cell physiology and cellular adaptation is only beginning to emerge. In contrast to antimicrobial peptides, which are secreted by prokaryotic and eukaryotic cells for combatting pathogens and competitors, small proteins act within the producing cell mainly by stabilizing protein assemblies and by modifying the activity of larger proteins. Production of small proteins is frequently linked to stress conditions or environmental changes, and therefore, cells seem to use small proteins as intracellular modifiers for adjusting cell metabolism to different intra- and extracellular cues. However, the size of small proteins imposes a major challenge for the cellular machinery required for protein folding and intracellular trafficking and recent data indicate that small proteins can engage distinct trafficking pathways. In the current review, we describe the diversity of small proteins in prokaryotes and eukaryotes, highlight distinct and common features, and illustrate how they are handled by the protein trafficking machineries in prokaryotic and eukaryotic cells. Finally, we also discuss future topics of research on this fascinating but largely unexplored group of proteins.
Collapse
Affiliation(s)
- Ruth Steinberg
- Institute for Biochemistry and Molecular Biology, Zentrum für Biochemie und Molekulare Medizin (ZMBZ), Faculty of Medicine, Albert-Ludwigs-Universität Freiburg, Germany
| | - Hans-Georg Koch
- Institute for Biochemistry and Molecular Biology, Zentrum für Biochemie und Molekulare Medizin (ZMBZ), Faculty of Medicine, Albert-Ludwigs-Universität Freiburg, Germany
| |
Collapse
|
29
|
Nasir MA, Nawaz S, Huang J. A Mini-review of Computational Approaches to Predict Functions and Findings of Novel Micro Peptides. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200811130522] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
:
New techniques in bioinformatics and the study of the transcriptome at a wide-scale
have uncovered the fact that a large part of the genome is being translated than recently perceived
thoughts and research, bringing about the creation of a various quantity of RNA with proteincoding
and noncoding potential. A lot of RNA particles have been considered as noncoding due to
many reasons, according to developing proofs. Like many sORFs that encode many functional
micro peptides have neglected due to their tiny sizes.
:
Advanced studies reveal many major biological functions of these sORFs and their encoded micro
peptides in a different and wide range of species. All the achievement in the identification of these
sORFs and micro peptides is due to the progressive bioinformatics and high-throughput
sequencing methods. This field has pulled in more consideration due to the detection of a large
number of more sORFs and micro peptides. Nowadays, COVID-19 grabs all the attention of
science as it is a sudden outbreak. sORFs of COVID-19 should be revealed for new ways to
understand this virus. This review discusses ongoing progress in the systems for the identification
and distinguishing proof of sORFs and micro peptides.
Collapse
Affiliation(s)
- Mohsin Ali Nasir
- Center for Informational Biology, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu 611731, China
| | - Samia Nawaz
- Center for Informational Biology, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu 611731, China
| | - Jian Huang
- Center for Informational Biology, University of Electronic Science and Technology of China, No. 2006, Xiyuan Ave, West Hi-Tech Zone, Chengdu 611731, China
| |
Collapse
|
30
|
Decio P, Ustaoglu P, Derecka K, Hardy ICW, Roat TC, Malaspina O, Mongan N, Stöger R, Soller M. Thiamethoxam exposure deregulates short ORF gene expression in the honey bee and compromises immune response to bacteria. Sci Rep 2021; 11:1489. [PMID: 33452318 PMCID: PMC7811001 DOI: 10.1038/s41598-020-80620-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Accepted: 12/23/2020] [Indexed: 01/29/2023] Open
Abstract
Maximizing crop yields relies on the use of agrochemicals to control insect pests. One of the most widely used classes of insecticides are neonicotinoids that interfere with signalling of the neurotransmitter acetylcholine, but these can also disrupt crop-pollination services provided by bees. Here, we analysed whether chronic low dose long-term exposure to the neonicotinoid thiamethoxam alters gene expression and alternative splicing in brains of Africanized honey bees, Apis mellifera, as adaptation to altered neuronal signalling. We find differentially regulated genes that show concentration-dependent responses to thiamethoxam, but no changes in alternative splicing. Most differentially expressed genes have no annotated function but encode short Open Reading Frames, a characteristic feature of anti-microbial peptides. As this suggested that immune responses may be compromised by thiamethoxam exposure, we tested the impact of thiamethoxam on bee immunity by injecting bacteria. We show that intrinsically sub-lethal thiamethoxam exposure makes bees more vulnerable to normally non-pathogenic bacteria. Our findings imply a synergistic mechanism for the observed bee population declines that concern agriculturists, conservation ecologists and the public.
Collapse
Affiliation(s)
- Pâmela Decio
- grid.410543.70000 0001 2188 478XInstitute of Biosciences, São Paulo State University (Unesp), Rio Claro, Brazil
| | - Pinar Ustaoglu
- grid.6572.60000 0004 1936 7486School of Biosciences, College of Life and Environmental Sciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT UK
| | - Kamila Derecka
- grid.4563.40000 0004 1936 8868School of Biosciences, University of Nottingham, Sutton Bonington, Loughborough, LE12 5RD UK
| | - Ian C. W. Hardy
- grid.4563.40000 0004 1936 8868School of Biosciences, University of Nottingham, Sutton Bonington, Loughborough, LE12 5RD UK
| | - Thaisa C. Roat
- grid.410543.70000 0001 2188 478XInstitute of Biosciences, São Paulo State University (Unesp), Rio Claro, Brazil
| | - Osmar Malaspina
- grid.410543.70000 0001 2188 478XInstitute of Biosciences, São Paulo State University (Unesp), Rio Claro, Brazil
| | - Nigel Mongan
- grid.4563.40000 0004 1936 8868School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington, Loughborough, LE12 5RD UK
| | - Reinhard Stöger
- grid.4563.40000 0004 1936 8868School of Biosciences, University of Nottingham, Sutton Bonington, Loughborough, LE12 5RD UK
| | - Matthias Soller
- grid.6572.60000 0004 1936 7486School of Biosciences, College of Life and Environmental Sciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT UK
| |
Collapse
|
31
|
Fabre B, Combier JP, Plaza S. Recent advances in mass spectrometry-based peptidomics workflows to identify short-open-reading-frame-encoded peptides and explore their functions. Curr Opin Chem Biol 2021; 60:122-130. [PMID: 33401134 DOI: 10.1016/j.cbpa.2020.12.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/26/2020] [Accepted: 12/03/2020] [Indexed: 12/12/2022]
Abstract
Short open reading frame (sORF)-encoded polypeptides (SEPs) have recently emerged as key regulators of major cellular processes. Computational methods for the annotation of sORFs combined with transcriptomics and ribosome profiling approaches predicted the existence of tens of thousands of SEPs across the kingdom of life. Although, we still lack unambiguous evidence for most of them. The method of choice to validate the expression of SEPs is mass spectrometry (MS)-based peptidomics. Peptides are less abundant than proteins, which tends to hinder their detection. Therefore, optimization and enrichment methods are necessary to validate the existence of SEPs. In this article, we discuss the challenges for the detection of SEPs by MS and recent developments of biochemical approaches applied to the study of these peptides. We detail the advances made in the different key steps of a typical peptidomics workflow and highlight possible alternatives that have not been explored yet.
Collapse
Affiliation(s)
- Bertrand Fabre
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, CNRS, 31320, Auzeville-Tolosane, France.
| | - Jean-Philippe Combier
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, CNRS, 31320, Auzeville-Tolosane, France
| | - Serge Plaza
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, CNRS, 31320, Auzeville-Tolosane, France
| |
Collapse
|
32
|
Casimiro-Soriguer CS, Rigual MM, Brokate-Llanos AM, Muñoz MJ, Garzón A, Pérez-Pulido AJ, Jimenez J. Using AnABlast for intergenic sORF prediction in the Caenorhabditis elegans genome. Bioinformatics 2020; 36:4827-4832. [PMID: 32614398 PMCID: PMC7723330 DOI: 10.1093/bioinformatics/btaa608] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 06/21/2020] [Accepted: 06/23/2020] [Indexed: 11/29/2022] Open
Abstract
Motivation Short bioactive peptides encoded by small open reading frames (sORFs) play important roles in eukaryotes. Bioinformatics prediction of ORFs is an early step in a genome sequence analysis, but sORFs encoding short peptides, often using non-AUG initiation codons, are not easily discriminated from false ORFs occurring by chance. Results AnABlast is a computational tool designed to highlight putative protein-coding regions in genomic DNA sequences. This protein-coding finder is independent of ORF length and reading frame shifts, thus making of AnABlast a potentially useful tool to predict sORFs. Using this algorithm, here, we report the identification of 82 putative new intergenic sORFs in the Caenorhabditis elegans genome. Sequence similarity, motif presence, expression data and RNA interference experiments support that the underlined sORFs likely encode functional peptides, encouraging the use of AnABlast as a new approach for the accurate prediction of intergenic sORFs in annotated eukaryotic genomes. Availability and implementation AnABlast is freely available at http://www.bioinfocabd.upo.es/ab/. The C.elegans genome browser with AnABlast results, annotated genes and all data used in this study is available at http://www.bioinfocabd.upo.es/celegans. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- C S Casimiro-Soriguer
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - M M Rigual
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - A M Brokate-Llanos
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - M J Muñoz
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - A Garzón
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - A J Pérez-Pulido
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| | - J Jimenez
- Centro Andaluz de Biología del Desarrollo (CABD, UPO-CSIC), Universidad Pablo de Olavide, 41013 Sevilla, Spain
| |
Collapse
|
33
|
Cardon T, Fournier I, Salzet M. Shedding Light on the Ghost Proteome. Trends Biochem Sci 2020; 46:239-250. [PMID: 33246829 DOI: 10.1016/j.tibs.2020.10.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 10/21/2020] [Accepted: 10/22/2020] [Indexed: 01/19/2023]
Abstract
Conventionally, eukaryotic mRNAs were thought to be monocistronic, leading to the translation of a single protein. However, large-scale proteomics has led to the identification of proteins translated from alternative open reading frames (AltORFs) in mRNAs. AltORFs are found in addition to predicted reference ORFs and noncoding RNA. Alternative proteins are not represented in the conventional protein databases, and this 'Ghost proteome' was not considered until recently. Some of these proteins are functional, and there is growing evidence that they are involved in central functions in physiological and physiopathological contexts. Here, we review how this Ghost proteome fills the gap in our understanding of signaling pathways, establishes new markers of pathologies, and highlights therapeutic targets.
Collapse
Affiliation(s)
- Tristan Cardon
- Laboratoire Protéomique, Réponse Inflammatoire Spectrométrie de Masse (PRISM), Inserm U1192, University of Lille, CHU Lille, F-59000 Lille, France.
| | - Isabelle Fournier
- Laboratoire Protéomique, Réponse Inflammatoire Spectrométrie de Masse (PRISM), Inserm U1192, University of Lille, CHU Lille, F-59000 Lille, France; Institut Universitaire de France, Paris, France.
| | - Michel Salzet
- Laboratoire Protéomique, Réponse Inflammatoire Spectrométrie de Masse (PRISM), Inserm U1192, University of Lille, CHU Lille, F-59000 Lille, France; Institut Universitaire de France, Paris, France.
| |
Collapse
|
34
|
Leblanc S, Brunet MA. Modelling of pathogen-host systems using deeper ORF annotations and transcriptomics to inform proteomics analyses. Comput Struct Biotechnol J 2020; 18:2836-2850. [PMID: 33133425 PMCID: PMC7585943 DOI: 10.1016/j.csbj.2020.10.010] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 10/07/2020] [Accepted: 10/08/2020] [Indexed: 01/08/2023] Open
Abstract
The Zika virus is a flavivirus that can cause fulminant outbreaks and lead to Guillain-Barré syndrome, microcephaly and fetal demise. Like other flaviviruses, the Zika virus is transmitted by mosquitoes and provokes neurological disorders. Despite its risk to public health, no antiviral nor vaccine are currently available. In the recent years, several studies have set to identify human host proteins interacting with Zika viral proteins to better understand its pathogenicity. Yet these studies used standard human protein sequence databases. Such databases rely on genome annotations, which enforce a minimal open reading frame (ORF) length criterion. An ever-increasing number of studies have demonstrated the shortcomings of such annotation, which overlooks thousands of functional ORFs. Here we show that the use of a customized database including currently non-annotated proteins led to the identification of 4 alternative proteins as interactors of the viral capsid and NS4A proteins. Furthermore, 12 alternative proteins were identified in the proteome profiling of Zika infected monocytes, one of which was significantly up-regulated. This study presents a computational framework for the re-analysis of proteomics datasets to better investigate the viral-host protein interplays upon infection with the Zika virus.
Collapse
Key Words
- AP-MS, affinity-purification mass spectrometry
- Alternative ORFs
- DEP, differentially expressed proteins
- FDR, false discovery rate
- FPKM, fragments per kilobase of exon model per million reads mapped
- Flavivirus
- HCIP, highly confident interacting proteins
- HCMV, human cytomegalovirus
- LFQ, label free quantification
- MS, mass spectrometry
- ORF, open reading frame
- PSM, peptide spectrum match
- Protein network
- Proteogenomics
- Proteome profiling
- ZIKV, Zika virus
- Zika
- altProt, alternative protein
- ncRNA, non-coding RNA
- sORF, small open reading frame
Collapse
Affiliation(s)
- Sebastien Leblanc
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Québec, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Canada
| | - Marie A. Brunet
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Québec, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Canada
| |
Collapse
|
35
|
Wang B, Hao J, Pan N, Wang Z, Chen Y, Wan C. Identification and analysis of small proteins and short open reading frame encoded peptides in Hep3B cell. J Proteomics 2020; 230:103965. [PMID: 32891891 DOI: 10.1016/j.jprot.2020.103965] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Revised: 06/25/2020] [Accepted: 08/31/2020] [Indexed: 02/05/2023]
Abstract
The small proteins and short open reading frames encoded peptides (SEPs) are of fundamental importance because of their essential roles in biological processes. However, the annotation or identification of them is challenging, in part owing to the limitation of the traditional genome annotation pipeline and their inherent characteristics of low abundance and low molecular weight. To discover and characterize SEPs in Hep3B cell line, we developed an optimized peptidomic assay by combining different peptide extraction and separation methods. The organic solvent precipitation method in peptidomic showed promotion in the enrichment of low molecular proteins or peptides, and the data clearly showed a beneficial effect from the reduction of sample complexity, resulting in high-quality MS/MS spectra. Furthermore, different strategies exhibited good complementarity in improving the total amount of small proteins and their sequence coverage. In total, 1192 proteins within less than 100 amino acids were identified, including 271 newly discovered SEPs that been annotated in the OpenProt database and 147 SEPs of them encoded from ncRNA or lincRNA. Results in this work provide robust evidence to date that the human proteome is more complicated than previously appreciated, and this will be a benefit to discoveries of proteins without function annotation. SIGNIFICANCE: In this work, methods were optimized to identify SEPs in Hep3B. The organic solvent precipitation presents promotion in enrichment of low molecular proteins or peptides, and the data clearly showed a beneficial effect from the reduction of sample complexity, resulting in high quality MS/MS spectra. Different strategies exhibited good complementarity in improving total amount of small proteins and their sequence coverage. In total, 1192 proteins within less than 100 amino acids were identified, including 271 newly discovered SEPs that been annotated in the OpenProt database and 147 SEPs of them encoded from ncRNA or lincRNA. Furthermore, 22 SEPs generated from the uORF may has potential effect in translation control, and 149 newly identified SEPs have known functional domains or cross-species conservation. Results in this work present robust evidence for the coding potential of the ignored region of human genomes and may provide additional insights into tumor biology.
Collapse
Affiliation(s)
- Bing Wang
- Hubei Key Lab of Genetic Regulation & Integrative Biology, School of Life Sciences, Central China Normal University, No. 152 Luoyu Road, Wuhan 430079, PR China
| | - Junhui Hao
- Hubei Key Lab of Genetic Regulation & Integrative Biology, School of Life Sciences, Central China Normal University, No. 152 Luoyu Road, Wuhan 430079, PR China
| | - Ni Pan
- Hubei Key Lab of Genetic Regulation & Integrative Biology, School of Life Sciences, Central China Normal University, No. 152 Luoyu Road, Wuhan 430079, PR China
| | - Zhiwei Wang
- Hubei Key Lab of Genetic Regulation & Integrative Biology, School of Life Sciences, Central China Normal University, No. 152 Luoyu Road, Wuhan 430079, PR China
| | - Yinxuan Chen
- Hubei Key Lab of Genetic Regulation & Integrative Biology, School of Life Sciences, Central China Normal University, No. 152 Luoyu Road, Wuhan 430079, PR China
| | - Cuihong Wan
- Hubei Key Lab of Genetic Regulation & Integrative Biology, School of Life Sciences, Central China Normal University, No. 152 Luoyu Road, Wuhan 430079, PR China.
| |
Collapse
|
36
|
Rochette L, Meloux A, Zeller M, Cottin Y, Vergely C. Role of humanin, a mitochondrial-derived peptide, in cardiovascular disorders. Arch Cardiovasc Dis 2020; 113:564-571. [PMID: 32680738 DOI: 10.1016/j.acvd.2020.03.020] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2019] [Revised: 03/20/2020] [Accepted: 03/20/2020] [Indexed: 11/29/2022]
Abstract
The mitochondria produce specific peptides-mitochondrial-derived peptides-that mediate the transcriptional stress response by their translocation into the nucleus and interaction with deoxyribonucleic acid. Mitochondrial-derived peptides are regulators of metabolism. This class of peptides comprises humanin, mitochondrial open reading frame of the 12S ribosomal ribonucleic acid type c (MOTS-c) and small humanin-like peptides (SHLPs). Humanin inhibits mitochondrial complex 1 activity and limits the level of oxidative stress in the cell. Data show that mitochondrial-derived peptides have a role in improving metabolic diseases, such as type 2 diabetes. Perhaps humanin can be used as a marker for mitochondrial function in cardiovascular disease or as a pharmacological strategy in patients with endothelial dysfunction. The goal of this review is to discuss the newly emerging functions of humanin, and its biological role in cardiovascular disorders.
Collapse
Affiliation(s)
- Luc Rochette
- Equipe d'Accueil (EA 7460), Physiopathologie et Epidémiologie Cérébro-Cardiovasculaires (PEC2), Université de Bourgogne - Franche Comté, Faculté des Sciences de Santé, 7, boulevard Jeanne-d'Arc, 21000 Dijon, France.
| | - Alexandre Meloux
- Equipe d'Accueil (EA 7460), Physiopathologie et Epidémiologie Cérébro-Cardiovasculaires (PEC2), Université de Bourgogne - Franche Comté, Faculté des Sciences de Santé, 7, boulevard Jeanne-d'Arc, 21000 Dijon, France
| | - Marianne Zeller
- Equipe d'Accueil (EA 7460), Physiopathologie et Epidémiologie Cérébro-Cardiovasculaires (PEC2), Université de Bourgogne - Franche Comté, Faculté des Sciences de Santé, 7, boulevard Jeanne-d'Arc, 21000 Dijon, France
| | - Yves Cottin
- Equipe d'Accueil (EA 7460), Physiopathologie et Epidémiologie Cérébro-Cardiovasculaires (PEC2), Université de Bourgogne - Franche Comté, Faculté des Sciences de Santé, 7, boulevard Jeanne-d'Arc, 21000 Dijon, France; Department of Cardiology, CHU Dijon Bourgogne, 21000 Dijon, France
| | - Catherine Vergely
- Equipe d'Accueil (EA 7460), Physiopathologie et Epidémiologie Cérébro-Cardiovasculaires (PEC2), Université de Bourgogne - Franche Comté, Faculté des Sciences de Santé, 7, boulevard Jeanne-d'Arc, 21000 Dijon, France
| |
Collapse
|
37
|
Patraquim P, Mumtaz MAS, Pueyo JI, Aspden JL, Couso JP. Developmental regulation of canonical and small ORF translation from mRNAs. Genome Biol 2020; 21:128. [PMID: 32471506 PMCID: PMC7260771 DOI: 10.1186/s13059-020-02011-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 04/08/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND Ribosomal profiling has revealed the translation of thousands of sequences outside annotated protein-coding genes, including small open reading frames of less than 100 codons, and the translational regulation of many genes. Here we present an improved version of Poly-Ribo-Seq and apply it to Drosophila melanogaster embryos to extend the catalog of in vivo translated small ORFs, and to reveal the translational regulation of both small and canonical ORFs from mRNAs across embryogenesis. RESULTS We obtain highly correlated samples across five embryonic stages, with nearly 500 million putative ribosomal footprints mapped to mRNAs, and compare them to existing Ribo-Seq and proteomic data. Our analysis reveals, for the first time in Drosophila, footprints mapping to codons in a phased pattern, the hallmark of productive translation. We propose a simple binomial probability metric to ascertain translation probability. Our results also reveal reproducible ribosomal binding apparently not resulting in productive translation. This non-productive ribosomal binding seems to be especially prevalent amongst upstream short ORFs located in the 5' mRNA leaders, and amongst canonical ORFs during the activation of the zygotic translatome at the maternal-to zygotic transition. CONCLUSIONS We suggest that this non-productive ribosomal binding might be due to cis-regulatory ribosomal binding and to defective ribosomal scanning of ORFs outside periods of productive translation. Our results are compatible with the main function of upstream short ORFs being to buffer the translation of canonical canonical ORFs; and show that, in general, small ORFs in mRNAs display markers compatible with an evolutionary transitory state towards full coding function.
Collapse
Affiliation(s)
- Pedro Patraquim
- Centro Andaluz de Biologia del Desarrollo, CSIC-UPO, Seville, Spain
| | | | | | - Julie Louise Aspden
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, LS2 9JT, UK
| | - Juan-Pablo Couso
- Centro Andaluz de Biologia del Desarrollo, CSIC-UPO, Seville, Spain. .,Previous address: Brighton and Sussex Medical School, Brighton, East Sussex, UK.
| |
Collapse
|
38
|
Brunet MA, Brunelle M, Lucier JF, Delcourt V, Levesque M, Grenier F, Samandi S, Leblanc S, Aguilar JD, Dufour P, Jacques JF, Fournier I, Ouangraoua A, Scott MS, Boisvert FM, Roucou X. OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes. Nucleic Acids Res 2020; 47:D403-D410. [PMID: 30299502 PMCID: PMC6323990 DOI: 10.1093/nar/gky936] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 10/04/2018] [Indexed: 01/06/2023] Open
Abstract
Advances in proteomics and sequencing have highlighted many non-annotated open reading frames (ORFs) in eukaryotic genomes. Genome annotations, cornerstones of today's research, mostly rely on protein prior knowledge and on ab initio prediction algorithms. Such algorithms notably enforce an arbitrary criterion of one coding sequence (CDS) per transcript, leading to a substantial underestimation of the coding potential of eukaryotes. Here, we present OpenProt, the first database fully endorsing a polycistronic model of eukaryotic genomes to date. OpenProt contains all possible ORFs longer than 30 codons across 10 species, and cumulates supporting evidence such as protein conservation, translation and expression. OpenProt annotates all known proteins (RefProts), novel predicted isoforms (Isoforms) and novel predicted proteins from alternative ORFs (AltProts). It incorporates cutting-edge algorithms to evaluate protein orthology and re-interrogate publicly available ribosome profiling and mass spectrometry datasets, supporting the annotation of thousands of predicted ORFs. The constantly growing database currently cumulates evidence from 87 ribosome profiling and 114 mass spectrometry studies from several species, tissues and cell lines. All data is freely available and downloadable from a web platform (www.openprot.org) supporting a genome browser and advanced queries for each species. Thus, OpenProt enables a more comprehensive landscape of eukaryotic genomes’ coding potential.
Collapse
Affiliation(s)
- Marie A Brunet
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université de Lille, F-59000 Lille, France
| | - Mylène Brunelle
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université de Lille, F-59000 Lille, France
| | - Jean-François Lucier
- Center for Computational Science, Université de Sherbrooke, Sherbrooke, Québec, Canada.,Biology Department, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Vivian Delcourt
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université de Lille, F-59000 Lille, France.,INSERM U1192, Laboratoire Protéomique, Réponse Inflammatoire & Spectrométrie de Masse (PRISM), Université de Lille, F-59000 Lille, France
| | - Maxime Levesque
- Center for Computational Science, Université de Sherbrooke, Sherbrooke, Québec, Canada.,Biology Department, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Frédéric Grenier
- Center for Computational Science, Université de Sherbrooke, Sherbrooke, Québec, Canada.,Biology Department, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Sondos Samandi
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université de Lille, F-59000 Lille, France
| | - Sébastien Leblanc
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Jean-David Aguilar
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Pascal Dufour
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Jean-Francois Jacques
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université de Lille, F-59000 Lille, France
| | - Isabelle Fournier
- INSERM U1192, Laboratoire Protéomique, Réponse Inflammatoire & Spectrométrie de Masse (PRISM), Université de Lille, F-59000 Lille, France
| | - Aida Ouangraoua
- Informatics Department, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | - Michelle S Scott
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada
| | | | - Xavier Roucou
- Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Québec, Canada.,PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université de Lille, F-59000 Lille, France
| |
Collapse
|
39
|
Liu H, Zhou X, Yuan M, Zhou S, Huang YE, Hou F, Song X, Wang L, Jiang W. ncEP: A Manually Curated Database for Experimentally Validated ncRNA-encoded Proteins or Peptides. J Mol Biol 2020; 432:3364-3368. [DOI: 10.1016/j.jmb.2020.02.022] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 02/12/2020] [Accepted: 02/12/2020] [Indexed: 10/24/2022]
|
40
|
Kubatova N, Pyper DJ, Jonker HRA, Saxena K, Remmel L, Richter C, Brantl S, Evguenieva‐Hackenberg E, Hess WR, Klug G, Marchfelder A, Soppa J, Streit W, Mayzel M, Orekhov VY, Fuxreiter M, Schmitz RA, Schwalbe H. Rapid Biophysical Characterization and NMR Spectroscopy Structural Analysis of Small Proteins from Bacteria and Archaea. Chembiochem 2020; 21:1178-1187. [PMID: 31705614 PMCID: PMC7217052 DOI: 10.1002/cbic.201900677] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Indexed: 01/08/2023]
Abstract
Proteins encoded by small open reading frames (sORFs) have a widespread occurrence in diverse microorganisms and can be of high functional importance. However, due to annotation biases and their technically challenging direct detection, these small proteins have been overlooked for a long time and were only recently rediscovered. The currently rapidly growing number of such proteins requires efficient methods to investigate their structure-function relationship. Herein, a method is presented for fast determination of the conformational properties of small proteins. Their small size makes them perfectly amenable for solution-state NMR spectroscopy. NMR spectroscopy can provide detailed information about their conformational states (folded, partially folded, and unstructured). In the context of the priority program on small proteins funded by the German research foundation (SPP2002), 27 small proteins from 9 different bacterial and archaeal organisms have been investigated. It is found that most of these small proteins are unstructured or partially folded. Bioinformatics tools predict that some of these unstructured proteins can potentially fold upon complex formation. A protocol for fast NMR spectroscopy structure elucidation is described for the small proteins that adopt a persistently folded structure by implementation of new NMR technologies, including automated resonance assignment and nonuniform sampling in combination with targeted acquisition.
Collapse
Affiliation(s)
- Nina Kubatova
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| | - Dennis J. Pyper
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| | - Hendrik R. A. Jonker
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| | - Krishna Saxena
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| | - Laura Remmel
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| | - Christian Richter
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| | - Sabine Brantl
- AG BakteriengenetikMatthias-Schleiden-InstitutPhilosophenweg 1207743JenaGermany
| | - Elena Evguenieva‐Hackenberg
- Institute for Microbiology and Molecular BiologyJustus Liebig University GiessenHeinrich-Buff-Ring 2635392GiessenGermany
| | - Wolfgang R. Hess
- Faculty of Biology, Genetics and Experimental BioinformaticsAlbert Ludwigs University FreiburgSchänzlestrasse 179104FreiburgGermany
| | - Gabriele Klug
- Institute for Microbiology and Molecular BiologyJustus Liebig University GiessenHeinrich-Buff-Ring 2635392GiessenGermany
| | | | - Jörg Soppa
- Institute for Molecular BiosciencesJohann Wolfgang Goethe UniversityMax-von-Laue-Strasse 960438Frankfurt am MainGermany
| | - Wolfgang Streit
- Department of Microbiology and BiotechnologyUniversity of HamburgOhnhorststrasse 1822609HamburgGermany
| | - Maxim Mayzel
- Swedish NMR CentreUniversity of GothenburgP. O. Box 46540530GothenburgSweden
| | - Vladislav Y. Orekhov
- Swedish NMR CentreUniversity of GothenburgP. O. Box 46540530GothenburgSweden
- Department of Chemistry and Molecular BiologyUniversity of GothenburgKemigården 441296GothenburgSweden
| | - Monika Fuxreiter
- MTA-DE Laboratory of Protein DynamicsDepartment of Biochemistry and Molecular BiologyUniversity of DebrecenNagyerdei krt 984032DebrecenHungary
| | - Ruth A. Schmitz
- Institute for General MicrobiologyChristian Albrechts University KielAm Botanischen Garten 1–924118KielGermany
| | - Harald Schwalbe
- Institute for Organic Chemistry and Chemical BiologyCenter for Biomolecular Magnetic Resonance (BMRZ)Johann Wolfgang Goethe UniversityMax-von-Laue-Strasse 760438Frankfurt/MainGermany
| |
Collapse
|
41
|
Orr MW, Mao Y, Storz G, Qian SB. Alternative ORFs and small ORFs: shedding light on the dark proteome. Nucleic Acids Res 2020; 48:1029-1042. [PMID: 31504789 DOI: 10.1093/nar/gkz734] [Citation(s) in RCA: 146] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Revised: 08/03/2019] [Accepted: 08/15/2019] [Indexed: 02/06/2023] Open
Abstract
Traditional annotation of protein-encoding genes relied on assumptions, such as one open reading frame (ORF) encodes one protein and minimal lengths for translated proteins. With the serendipitous discoveries of translated ORFs encoded upstream and downstream of annotated ORFs, from alternative start sites nested within annotated ORFs and from RNAs previously considered noncoding, it is becoming clear that these initial assumptions are incorrect. The findings have led to the realization that genetic information is more densely coded and that the proteome is more complex than previously anticipated. As such, interest in the identification and characterization of the previously ignored 'dark proteome' is increasing, though we note that research in eukaryotes and bacteria has largely progressed in isolation. To bridge this gap and illustrate exciting findings emerging from studies of the dark proteome, we highlight recent advances in both eukaryotic and bacterial cells. We discuss progress in the detection of alternative ORFs as well as in the understanding of functions and the regulation of their expression and posit questions for future work.
Collapse
Affiliation(s)
- Mona Wu Orr
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892, USA
| | - Yuanhui Mao
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Gisela Storz
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892, USA
| | - Shu-Bing Qian
- Division of Nutritional Sciences, Cornell University, Ithaca, NY 14853, USA
| |
Collapse
|
42
|
Ruiz-Orera J, Villanueva-Cañas JL, Albà MM. Evolution of new proteins from translated sORFs in long non-coding RNAs. Exp Cell Res 2020; 391:111940. [PMID: 32156600 DOI: 10.1016/j.yexcr.2020.111940] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Revised: 02/26/2020] [Accepted: 03/02/2020] [Indexed: 01/07/2023]
Abstract
High throughput RNA sequencing techniques have revealed that a large fraction of the genome is transcribed into long non-coding RNAs (lncRNAs). Unlike canonical protein-coding genes, lncRNAs do not contain long open reading frames (ORFs) and tend to be poorly conserved across species. However, many of them contain small ORFs (sORFs) that exhibit translation signatures according to ribosome profiling or proteomics data. These sORFs are a source of putative novel proteins; some of them may confer a selective advantage and be maintained over time, a process known as de novo gene birth. Here we review the mechanisms by which randomly occurring sORFs in lncRNAs can become new functional proteins.
Collapse
Affiliation(s)
- Jorge Ruiz-Orera
- Cardiovascular and Metabolic Sciences, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin, Germany
| | | | - M Mar Albà
- Evolutionary Genomics Group, Research Programme in Biomedical Informatics, Hospital Del Mar Research Institute (IMIM), Universitat Pompeu Fabra (UPF), Barcelona, Spain; Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, 08010, Spain.
| |
Collapse
|
43
|
Makarewich CA. The hidden world of membrane microproteins. Exp Cell Res 2020; 388:111853. [PMID: 31978386 DOI: 10.1016/j.yexcr.2020.111853] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 01/03/2020] [Accepted: 01/14/2020] [Indexed: 12/26/2022]
Abstract
Proteins are critical components of biological membranes and play key roles in many essential cellular processes. Membrane proteins are a structurally and functionally diverse family of proteins that have recently expanded to include a number of newly discovered tiny proteins called microproteins, or micropeptides. These microproteins are generated from small open reading frames, which produce protein products that are less than 100 amino acids in length. While not all microproteins are membrane proteins, this review will focus specifically on this subclass to highlight some of the important biological activities that have been ascribed to these molecules and to emphasize their promise as exciting new players in membrane biology.
Collapse
Affiliation(s)
- Catherine A Makarewich
- The Heart Institute and Division of Molecular Cardiovascular Biology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, United States; Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, United States.
| |
Collapse
|
44
|
Bhatta A, Atianand M, Jiang Z, Crabtree J, Blin J, Fitzgerald KA. A Mitochondrial Micropeptide Is Required for Activation of the Nlrp3 Inflammasome. THE JOURNAL OF IMMUNOLOGY 2019; 204:428-437. [PMID: 31836654 DOI: 10.4049/jimmunol.1900791] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 11/10/2019] [Indexed: 01/01/2023]
Abstract
Functional peptides encoded by short open reading frames are emerging as important mediators of fundamental biological processes. In this study, we identified a micropeptide produced from a putative long noncoding RNA (lncRNAs) that is important in controlling innate immunity. By studying lncRNAs in mice macrophages, we identified lncRNA 1810058I24Rik, which was downregulated in both human and murine myeloid cells exposed to LPS as well as other TLR ligands and inflammatory cytokines. Analysis of lncRNA 1810058I24Rik subcellular localization revealed that this transcript was localized in the cytosol, prompting us to evaluate its coding potential. In vitro translation with 35S-labeled methionine resulted in translation of a 47 aa micropeptide. Microscopy and subcellular fractionation studies in macrophages demonstrated endogenous expression of this peptide on the mitochondrion. We thus named this gene mitochondrial micropeptide-47 (Mm47). Crispr-Cas9-mediated deletion of Mm47, as well as small interfering RNA studies in mice primary macrophages, showed that the transcriptional response downstream of TLR4 was intact in cells lacking Mm47. In contrast, Mm47-deficient or knockdown cells were compromised for Nlrp3 inflammasome responses. Activation of Nlrc4 or Aim2 inflammasomes were intact in cells lacking Mm47. This study therefore identifies, to our knowledge, a novel mitochondrial micropeptide Mm47 that is required for the activation of the Nlrp3 inflammasome. This work further highlights the functional activity of short open reading frame-encoded peptides and underscores their importance in innate immunity.
Collapse
Affiliation(s)
- Ankit Bhatta
- Program in Innate Immunity, Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01605; and
| | - Maninjay Atianand
- Department of Immunology, University of Pittsburgh, Pittsburgh, PA 15261
| | - Zhaozhao Jiang
- Program in Innate Immunity, Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01605; and
| | - Juliet Crabtree
- Program in Innate Immunity, Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01605; and
| | - Juliana Blin
- Program in Innate Immunity, Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01605; and
| | - Katherine A Fitzgerald
- Program in Innate Immunity, Department of Medicine, University of Massachusetts Medical School, Worcester, MA 01605; and
| |
Collapse
|
45
|
Monteuuis G, Miścicka A, Świrski M, Zenad L, Niemitalo O, Wrobel L, Alam J, Chacinska A, Kastaniotis AJ, Kufel J. Non-canonical translation initiation in yeast generates a cryptic pool of mitochondrial proteins. Nucleic Acids Res 2019; 47:5777-5791. [PMID: 31216041 PMCID: PMC6582344 DOI: 10.1093/nar/gkz301] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 04/12/2019] [Accepted: 04/16/2019] [Indexed: 12/15/2022] Open
Abstract
Utilization of non-AUG alternative translation start sites is most common in bacteria and viruses, but it has been also reported in other organisms. This phenomenon increases proteome complexity by allowing expression of multiple protein isoforms from a single gene. In Saccharomyces cerevisiae, a few described cases concern proteins that are translated from upstream near-cognate start codons as N-terminally extended variants that localize to mitochondria. Using bioinformatics tools, we provide compelling evidence that in yeast the potential for producing alternative protein isoforms by non-AUG translation initiation is much more prevalent than previously anticipated and may apply to as many as a few thousand proteins. Several hundreds of candidates are predicted to gain a mitochondrial targeting signal (MTS), generating an unrecognized pool of mitochondrial proteins. We confirmed mitochondrial localization of a subset of proteins previously not identified as mitochondrial, whose standard forms do not carry an MTS. Our data highlight the potential of non-canonical translation initiation in expanding the capacity of the mitochondrial proteome and possibly also other cellular features.
Collapse
Affiliation(s)
- Geoffray Monteuuis
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, FIN-90014 Finland
| | - Anna Miścicka
- Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, 02-106 Warsaw, Poland
| | - Michał Świrski
- Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, 02-106 Warsaw, Poland
| | - Lounis Zenad
- Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, 02-106 Warsaw, Poland
| | - Olli Niemitalo
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, FIN-90014 Finland
| | - Lidia Wrobel
- International Institute of Molecular and Cell Biology, 02-109 Warsaw, Poland
| | - Jahangir Alam
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, FIN-90014 Finland
| | - Agnieszka Chacinska
- International Institute of Molecular and Cell Biology, 02-109 Warsaw, Poland.,Centre of New Technologies, University of Warsaw, 02-097 Warsaw, Poland
| | - Alexander J Kastaniotis
- Faculty of Biochemistry and Molecular Medicine, University of Oulu, P.O. Box 5400, FIN-90014 Finland
| | - Joanna Kufel
- Institute of Genetics and Biotechnology, Faculty of Biology, University of Warsaw, 02-106 Warsaw, Poland
| |
Collapse
|
46
|
Tobias-Santos V, Guerra-Almeida D, Mury F, Ribeiro L, Berni M, Araujo H, Logullo C, Feitosa NM, de Souza-Menezes J, Pessoa Costa E, Nunes-da-Fonseca R. Multiple Roles of the Polycistronic Gene Tarsal-less/Mille-Pattes/Polished-Rice During Embryogenesis of the Kissing Bug Rhodnius prolixus. Front Ecol Evol 2019. [DOI: 10.3389/fevo.2019.00379] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
47
|
Weaver J, Mohammad F, Buskirk AR, Storz G. Identifying Small Proteins by Ribosome Profiling with Stalled Initiation Complexes. mBio 2019; 10:e02819-18. [PMID: 30837344 PMCID: PMC6401488 DOI: 10.1128/mbio.02819-18] [Citation(s) in RCA: 108] [Impact Index Per Article: 21.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2018] [Accepted: 01/24/2019] [Indexed: 11/20/2022] Open
Abstract
Small proteins consisting of 50 or fewer amino acids have been identified as regulators of larger proteins in bacteria and eukaryotes. Despite the importance of these molecules, the total number of small proteins remains unknown because conventional annotation pipelines usually exclude small open reading frames (smORFs). We previously identified several dozen small proteins in the model organism Escherichia coli using theoretical bioinformatic approaches based on sequence conservation and matches to canonical ribosome binding sites. Here, we present an empirical approach for discovering new proteins, taking advantage of recent advances in ribosome profiling in which antibiotics are used to trap newly initiated 70S ribosomes at start codons. This approach led to the identification of many novel initiation sites in intergenic regions in E. coli We tagged 41 smORFs on the chromosome and detected protein synthesis for all but three. Not only are the corresponding genes intergenic but they are also found antisense to other genes, in operons, and overlapping other open reading frames (ORFs), some impacting the translation of larger downstream genes. These results demonstrate the utility of this method for identifying new genes, regardless of their genomic context.IMPORTANCE Proteins comprised of 50 or fewer amino acids have been shown to interact with and modulate the functions of larger proteins in a range of organisms. Despite the possible importance of small proteins, the true prevalence and capabilities of these regulators remain unknown as the small size of the proteins places serious limitations on their identification, purification, and characterization. Here, we present a ribosome profiling approach with stalled initiation complexes that led to the identification of 38 new small proteins.
Collapse
Affiliation(s)
- Jeremy Weaver
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland, USA
| | - Fuad Mohammad
- Department of Molecular Biology and Genetics, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Allen R Buskirk
- Department of Molecular Biology and Genetics, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Gisela Storz
- Division of Molecular and Cellular Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, Maryland, USA
| |
Collapse
|
48
|
LINC00116 codes for a mitochondrial peptide linking respiration and lipid metabolism. Proc Natl Acad Sci U S A 2019; 116:4940-4945. [PMID: 30796188 PMCID: PMC6421467 DOI: 10.1073/pnas.1809105116] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Short peptides are encoded in genomes of all organisms and have important functions. Due to the small size of such open reading frames, they are frequently overlooked by automatic genome annotation. We investigated the gene that was misannotated as long noncoding RNA LINC00116 and demonstrated that this gene codes for a 56-amino-acid-long peptide, Mtln, which is localized in mitochondria. Inactivation of the Mtln coding gene leads to reduction of oxygen consumption attributed to respiratory complex I activity and perturbs lipid composition of the cell. This influence is mediated by Mtln interaction with NADH-dependent cytochrome b5 reductase. Disruption of the mitochondrial localization of the latter phenocopies Mtln inactivation. Genes coding for small peptides have been frequently misannotated as long noncoding RNA (lncRNA) genes. Here we have demonstrated that one such transcript is translated into a 56-amino-acid-long peptide conserved in chordates, corroborating the work published while this manuscript was under review. The Mtln peptide could be detected in mitochondria of mouse cell lines and tissues. In line with its mitochondrial localization, lack of the Mtln decreases the activity of mitochondrial respiratory chain complex I. Unlike the integral components and assembly factors of NADH:ubiquinone oxidoreductase, Mtln does not alter its enzymatic activity directly. Interaction of Mtln with NADH-dependent cytochrome b5 reductase stimulates complex I functioning most likely by providing a favorable lipid composition of the membrane. Study of Mtln illuminates the importance of small peptides, whose genes might frequently be misannotated as lncRNAs, for the control of vitally important cellular processes.
Collapse
|
49
|
Abstract
INTRODUCTION Small open reading frames (sORFs) with potential protein-coding capacity have been disclosed in various transcripts, including long noncoding RNAs (LncRNAs), mRNAs (5'-upstream, coding domain, and 3'-downstream), circular RNAs, pri-miRNAs, and ribosomal RNAs (rRNAs). Recent characterization of several sORF-encoded peptides (SEPs or micropeptides) revealed their important roles in many fundamental biological processes in a broad range of species from yeast to human. The success in the mining of micropeptides attributes to the advanced bioinformatics and high-throughput sequencing techniques. Areas covered: sORFs and SEPs were overlooked for their tiny size and the difficulty of identification by bioinformatics analyses. With more and more sORFs and SEPs have been identified, this field has attracted more attention. This review covers recent advances in the strategies for the detection and identification of sORFs and SEPs. Expert commentary: The advantages and drawbacks of the strategies for detection and identification of sORFs and SEPs are discussed, as well as the techniques that are used to decipher the roles of micropeptides in organisms are described.
Collapse
Affiliation(s)
- Xinqiang Yin
- a The Engineering Research Center of Synthetic Polypeptide Drug Discovery and Evaluation of Jiangsu Province , China Pharmaceutical University , Nanjing , China.,b The Basic Medical School , North Sichuan Medical College , Nanchong , China
| | - Yuanyuan Jing
- c Department of Preventive Medicine , North Sichuan Medical College , Nanchong , China
| | - Hanmei Xu
- a The Engineering Research Center of Synthetic Polypeptide Drug Discovery and Evaluation of Jiangsu Province , China Pharmaceutical University , Nanjing , China.,d State Key Laboratory of Natural Medicines, Ministry of Education , China Pharmaceutical University , Nanjing , China
| |
Collapse
|
50
|
Abstract
The life span of cancer patients can be prolonged with appropriate therapies if detected early. Mass screening for early detection of cancer, however, requires sensitive and specific biomarkers obtainable from body fluids such as blood or urine. To date, most biomarker discovery programs focus on the proteome rather than the endogenous peptidome. It has been long-established that tumor cells and stromal cells produce tumor resident proteases (TRPs) to remodel the surrounding tumor microenvironment in support of tumor progression. In fact, proteolytic products of TRPs have been shown to correlate with malignant behavior. Being of low molecular weight, these unique peptides can pass through the endothelial barrier of the vasculature into the bloodstream. As such, the cancer peptidome has increasingly become a focus for biomarker discovery. In this review, we discuss on the various aspects of the peptidome in cancer biomarker research.
Collapse
Affiliation(s)
- Pey Yee Lee
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Teck Yew Low
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia.
| | - Rahman Jamal
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| |
Collapse
|