1
|
Imaduwage I, Hewadikaram M. Predicted roles of long non-coding RNAs in abiotic stress tolerance responses of plants. MOLECULAR HORTICULTURE 2024; 4:20. [PMID: 38745264 PMCID: PMC11094901 DOI: 10.1186/s43897-024-00094-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 04/06/2024] [Indexed: 05/16/2024]
Abstract
The plant genome exhibits a significant amount of transcriptional activity, with most of the resulting transcripts lacking protein-coding potential. Non-coding RNAs play a pivotal role in the development and regulatory processes in plants. Long non-coding RNAs (lncRNAs), which exceed 200 nucleotides, may play a significant role in enhancing plant resilience to various abiotic stresses, such as excessive heat, drought, cold, and salinity. In addition, the exogenous application of chemicals, such as abscisic acid and salicylic acid, can augment plant defense responses against abiotic stress. While how lncRNAs play a role in abiotic stress tolerance is relatively well-studied in model plants, this review provides a comprehensive overview of the current understanding of this function in horticultural crop plants. It also delves into the potential role of lncRNAs in chemical priming of plants in order to acquire abiotic stress tolerance, although many limitations exist in proving lncRNA functionality under such conditions.
Collapse
Affiliation(s)
- Iuh Imaduwage
- Department of Biomedical Sciences, Faculty of Science, NSBM Green University, Pitipana, Homagama, Sri Lanka
| | - Madhavi Hewadikaram
- Department of Biomedical Sciences, Faculty of Science, NSBM Green University, Pitipana, Homagama, Sri Lanka.
| |
Collapse
|
2
|
Erokhina TN, Ryazantsev DY, Zavriev SK, Morozov SY. Biological Activity of Artificial Plant Peptides Corresponding to the Translational Products of Small ORFs in Primary miRNAs and Other Long "Non-Coding" RNAs. PLANTS (BASEL, SWITZERLAND) 2024; 13:1137. [PMID: 38674546 PMCID: PMC11055055 DOI: 10.3390/plants13081137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 04/04/2024] [Accepted: 04/18/2024] [Indexed: 04/28/2024]
Abstract
Generally, lncPEPs (peptides encoded by long non-coding RNAs) have been identified in many plant species of several families and in some animal species. Importantly, molecular mechanisms of the miPEPs (peptides encoded by primary microRNAs, pri-miRNAs) are often poorly understood in different flowering plants. Requirement for the additional studies in these directions is highlighted by alternative findings concerning positive regulation of pri-miRNA/miRNA expression by synthetic miPEPs in plants. Further extensive studies are also needed to understand the full set of their roles in eukaryotic organisms. This review mainly aims to consider the available data on the regulatory functions of the synthetic miPEPs. Studies of chemically synthesized miPEPs and analyzing the fine molecular mechanisms of their functional activities are reviewed. Brief description of the studies to identify lncORFs (open reading frames of long non-coding RNAs) and the encoded protein products is also provided.
Collapse
Affiliation(s)
- T. N. Erokhina
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia (S.K.Z.)
| | - D. Y. Ryazantsev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia (S.K.Z.)
| | - S. K. Zavriev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia (S.K.Z.)
| | - S. Y. Morozov
- Biological Faculty, Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, 119991 Moscow, Russia
| |
Collapse
|
3
|
Li H, Meng J, Wang Z, Tang Y, Xia S, Wang Y, Qin Z, Luan Y. miPEPPred-FRL: A Novel Method for Predicting Plant MiRNA-Encoded Peptides Using Adaptive Feature Representation Learning. J Chem Inf Model 2024; 64:2889-2900. [PMID: 37733290 DOI: 10.1021/acs.jcim.3c01020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]
Abstract
MicroRNAs (miRNAs) are an essential type of small molecule RNAs that play significant regulatory roles in organisms. Recent studies have demonstrated that small open reading frames (sORFs) harbored in primary miRNAs (pri-miRNAs) can encode small peptides, known as miPEPs. Plant miPEPs can increase the abundance and activity of cognate miRNAs by promoting the transcription of their corresponding pri-miRNAs, thereby modulating plant traits. Biological experiments are the most effective way to accurately identify miPEPs; however, they are time-consuming and expensive. Hence, an efficient computational method for the identification of miPEPs on a large scale is highly desirable. Up to now, there have been no specialized computational tools for identifying miPEPs. In this work, a novel predictor named miPEPPred-FRL based on an adaptive feature representation learning framework that consists of the feature transformation module and the cascade architecture has been proposed. The feature transformation module integrating a newly designed feature selection method and classifier selection rule is developed to convert sequence-based features into primary class and probabilistic features, which are then fed into the improved cascade architecture to obtain more stable and discriminative augmented features. Finally, the augmented features are utilized to construct the final predictor. Cross-validation experiments illustrate that the novel feature selection method and classifier selection rule contribute to boosting the feature representation ability of the framework. Furthermore, the high accuracy of miPEPPred-FRL on independent testing data suggests that it is a trustworthy and valuable tool for the identification of miPEPs.
Collapse
Affiliation(s)
- Haibin Li
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Zhaowei Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Youwei Tang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Shihao Xia
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Yu Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Zhaojing Qin
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning 116024, China
| |
Collapse
|
4
|
Peng Z, Li J, Jiang X, Wan C. sOCP: a framework predicting smORF coding potential based on TIS and in-frame features and effectively applied in the human genome. Brief Bioinform 2024; 25:bbae147. [PMID: 38600664 PMCID: PMC11006793 DOI: 10.1093/bib/bbae147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 02/25/2024] [Accepted: 03/19/2024] [Indexed: 04/12/2024] Open
Abstract
Small open reading frames (smORFs) have been acknowledged to play various roles on essential biological pathways and affect human beings from diabetes to tumorigenesis. Predicting smORFs in silico is quite a prerequisite for processing the omics data. Here, we proposed the smORF-coding-potential-predicting framework, sOCP, which provides functions to construct a model for predicting novel smORFs in some species. The sOCP model constructed in human was based on in-frame features and the nucleotide bias around the start codon, and the small feature subset was proved to be competent enough and avoid overfitting problems for complicated models. It showed more advanced prediction metrics than previous methods and could correlate closely with experimental evidence in a heterogeneous dataset. The model was applied to Rattus norvegicus and exhibited satisfactory performance. We then scanned smORFs with ATG and non-ATG start codons from the human genome and generated a database containing about a million novel smORFs with coding potential. Around 72 000 smORFs are located on the lncRNA regions of the genome. The smORF-encoded peptides may be involved in biological pathways rare for canonical proteins, including glucocorticoid catabolic process and the prokaryotic defense system. Our work provides a model and database for human smORF investigation and a convenient tool for further smORF prediction in other species.
Collapse
Affiliation(s)
- Zhao Peng
- School of Life Sciences, and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Jiaqiang Li
- School of Computer Science, and Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Xingpeng Jiang
- School of Computer Science, and Hubei Provincial Key Laboratory of Artificial Intelligence and Smart Learning, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| | - Cuihong Wan
- School of Life Sciences, and Hubei Key Laboratory of Genetic Regulation and Integrative Biology, Central China Normal University, Wuhan 430079, Hubei, People’s Republic of China
| |
Collapse
|
5
|
Wang J, Li Y, Li M, Zhang W, Lu Y, Hua K, Ling X, Chen T, Guo D, Yang Y, Zheng Z, Liu Q, Zhang B. Translatome and Transcriptome Analyses Reveal the Mechanism that Underlies the Enhancement of Salt Stress by the Small Peptide Ospep5 in Plants. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:4277-4291. [PMID: 38288993 DOI: 10.1021/acs.jafc.3c08528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
Salt stress significantly impedes plant growth and the crop yield. This study utilized de novo transcriptome assembly and ribosome profiling to explore mRNA translation's role in rice salt tolerance. We identified unrecognized translated open reading frames (ORFs), including 42 upstream transcripts and 86 unannotated transcripts. A noteworthy discovery was the role of a small ORF, Ospep5, in conferring salt tolerance. Overexpression of Ospep5 in plants increased salt tolerance, while its absence led to heightened sensitivity. This hypothesis was corroborated by the findings that exogenous application of the synthetic small peptide Ospep5 bolstered salt tolerance in both rice and Arabidopsis. We found that the mechanism underpinning the Ospep5-mediated salt tolerance involves the maintenance of intracellular Na+/K+ homeostasis, facilitated by upregulation of high-affinity potassium transporters (HKT) and Na+/H+ exchangers (SOS1). Furthermore, a comprehensive multiomics approach, particularly ribosome profiling, is instrumental in uncovering unannotated ORFs and elucidating their functions in plant stress responses.
Collapse
Affiliation(s)
- Jinyan Wang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Yang Li
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Mingyue Li
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Wenting Zhang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Yaping Lu
- Experimental center of Life Sciences, College of Life Sciences, Nanjing Agricultural University, Nanjing 210095, China
| | - Kai Hua
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Botanical Garden, Shanghai 201602, China
| | - Xitie Ling
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
| | - Tianzi Chen
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Dongshu Guo
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Yuwen Yang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
| | - Zhongbing Zheng
- School of Breeding and Multiplication (Sanya Institute of Breeding and Multiplication), Hainan University, Sanya 572025, China
| | - Qing Liu
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
- College of Agriculture, Yangzhou University, Yangzhou 225009, China
| | - Baolong Zhang
- Provincial Key Laboratory of Agrobiology and Institute of Germplasm Resources and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, China
- Zhongshan Biological Breeding Laboratory, Nanjing 210014, China
- School of Breeding and Multiplication (Sanya Institute of Breeding and Multiplication), Hainan University, Sanya 572025, China
| |
Collapse
|
6
|
Wu HYL, Ai Q, Teixeira RT, Nguyen PHT, Song G, Montes C, Elmore JM, Walley JW, Hsu PY. Improved super-resolution ribosome profiling reveals prevalent translation of upstream ORFs and small ORFs in Arabidopsis. THE PLANT CELL 2024; 36:510-539. [PMID: 38000896 PMCID: PMC10896292 DOI: 10.1093/plcell/koad290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/13/2023] [Accepted: 10/17/2023] [Indexed: 11/26/2023]
Abstract
A crucial step in functional genomics is identifying actively translated ORFs and linking them to biological functions. The challenge lies in identifying short ORFs, as their identification is greatly influenced by data quality and depth. Here, we improved the coverage of super-resolution Ribo-seq in Arabidopsis (Arabidopsis thaliana), revealing uncharacterized translation events for nuclear, chloroplastic, and mitochondrial genes. Assisted by a transcriptome assembly, we identified 7,751 unconventional translation events, comprising 6,996 upstream ORFs (uORFs) and 209 downstream ORFs on annotated protein-coding genes, as well as 546 ORFs in presumed noncoding RNAs. Proteomic data confirmed the production of stable proteins from some of these unannotated translation events. We present evidence of active translation from primary transcripts of trans-acting small interfering RNAs (TAS1-4) and microRNAs (pri-MIR163 and pri-MIR169) and periodic ribosome stalling supporting cotranslational decay. Additionally, we developed a method for identifying extremely short uORFs, including 370 minimum uORFs (AUG-stop), and 2,921 tiny uORFs (2 to 10 amino acids) and 681 uORFs that overlap with each other. Remarkably, these short uORFs exhibit strong translational repression as do longer uORFs. We also systematically discovered 594 uORFs regulated by alternative splicing, suggesting widespread isoform-specific translational control. Finally, these prevalent uORFs are associated with numerous important pathways. In summary, our improved Arabidopsis translational landscape provides valuable resources to study gene expression regulation.
Collapse
Affiliation(s)
- Hsin-Yen Larry Wu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Qiaoyun Ai
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Rita Teresa Teixeira
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Phong H T Nguyen
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| | - Gaoyuan Song
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Christian Montes
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - J Mitch Elmore
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Justin W Walley
- Department of Plant Pathology, Entomology, and Microbiology, Iowa State University, Ames, IA 50011, USA
| | - Polly Yingshan Hsu
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
7
|
Fesenko I, Sahakyan H, Shabalina SA, Koonin EV. The Cryptic Bacterial Microproteome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.17.580829. [PMID: 38903115 PMCID: PMC11188072 DOI: 10.1101/2024.02.17.580829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/22/2024]
Abstract
Microproteins encoded by small open reading frames (smORFs) comprise the "dark matter" of proteomes. Although functional microproteins were identified in diverse organisms from all three domains of life, bacterial smORFs remain poorly characterized. In this comprehensive study of intergenic smORFs (ismORFs, 15-70 codons) in 5,668 bacterial genomes of the family Enterobacteriaceae, we identified 67,297 clusters of ismORFs subject to purifying selection. The ismORFs mainly code for hydrophobic, potentially transmembrane, unstructured, or minimally structured microproteins. Using AlphaFold Multimer, we predicted interactions of some of the predicted microproteins encoded by transcribed ismORFs with proteins encoded by neighboring genes, revealing the potential of microproteins to regulate the activity of various proteins, particularly, under stress. We compiled a catalog of predicted microprotein families with different levels of evidence from synteny analysis, structure prediction, and transcription and translation data. This study offers a resource for investigation of biological functions of microproteins.
Collapse
Affiliation(s)
- Igor Fesenko
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Harutyun Sahakyan
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Svetlana A. Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V. Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
8
|
Schrader M. Origins, Technological Advancement, and Applications of Peptidomics. Methods Mol Biol 2024; 2758:3-47. [PMID: 38549006 DOI: 10.1007/978-1-0716-3646-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2024]
Abstract
Peptidomics is the comprehensive characterization of peptides from biological sources instead of heading for a few single peptides in former peptide research. Mass spectrometry allows to detect a multitude of peptides in complex mixtures and thus enables new strategies leading to peptidomics. The term was established in the year 2001, and up to now, this new field has grown to over 3000 publications. Analytical techniques originally developed for fast and comprehensive analysis of peptides in proteomics were specifically adjusted for peptidomics. Although it is thus closely linked to proteomics, there are fundamental differences with conventional bottom-up proteomics. Fundamental technological advancements of peptidomics since have occurred in mass spectrometry and data processing, including quantification, and more slightly in separation technology. Different strategies and diverse sources of peptidomes are mentioned by numerous applications, such as discovery of neuropeptides and other bioactive peptides, including the use of biochemical assays. Furthermore, food and plant peptidomics are introduced similarly. Additionally, applications with a clinical focus are included, comprising biomarker discovery as well as immunopeptidomics. This overview extensively reviews recent methods, strategies, and applications including links to all other chapters of this book.
Collapse
Affiliation(s)
- Michael Schrader
- Department of Bioengineering Sciences, Weihenstephan-Tr. University of Applied Sciences, Freising, Germany.
| |
Collapse
|
9
|
Zhang L, Lin T, Zhu G, Wu B, Zhang C, Zhu H. LncRNAs exert indispensable roles in orchestrating the interaction among diverse noncoding RNAs and enrich the regulatory network of plant growth and its adaptive environmental stress response. HORTICULTURE RESEARCH 2023; 10:uhad234. [PMID: 38156284 PMCID: PMC10753412 DOI: 10.1093/hr/uhad234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 11/01/2023] [Indexed: 12/30/2023]
Abstract
With the advent of advanced sequencing technologies, non-coding RNAs (ncRNAs) are increasingly pivotal and play highly regulated roles in the modulation of diverse aspects of plant growth and stress response. This includes a spectrum of ncRNA classes, ranging from small RNAs to long non-coding RNAs (lncRNAs). Notably, among these, lncRNAs emerge as significant and intricate components within the broader ncRNA regulatory networks. Here, we categorize ncRNAs based on their length and structure into small RNAs, medium-sized ncRNAs, lncRNAs, and circle RNAs. Furthermore, the review delves into the detailed biosynthesis and origin of these ncRNAs. Subsequently, we emphasize the diverse regulatory mechanisms employed by lncRNAs that are located at various gene regions of coding genes, embodying promoters, 5'UTRs, introns, exons, and 3'UTR regions. Furthermore, we elucidate these regulatory modes through one or two concrete examples. Besides, lncRNAs have emerged as novel central components that participate in phase separation processes. Moreover, we illustrate the coordinated regulatory mechanisms among lncRNAs, miRNAs, and siRNAs with a particular emphasis on the central role of lncRNAs in serving as sponges, precursors, spliceosome, stabilization, scaffolds, or interaction factors to bridge interactions with other ncRNAs. The review also sheds light on the intriguing possibility that some ncRNAs may encode functional micropeptides. Therefore, the review underscores the emergent roles of ncRNAs as potent regulatory factors that significantly enrich the regulatory network governing plant growth, development, and responses to environmental stimuli. There are yet-to-be-discovered roles of ncRNAs waiting for us to explore.
Collapse
Affiliation(s)
- Lingling Zhang
- College of Food Science and Nutritional Engineering, China Agricultural University, Beijing, 100083, China
| | - Tao Lin
- College of Horticulture, China Agricultural University, Beijing 100193, China
| | - Guoning Zhu
- College of Food Science and Nutritional Engineering, China Agricultural University, Beijing, 100083, China
| | - Bin Wu
- Institute of Agro-products Storage and Processing, Xinjiang Academy of Agricultural Science, Urumqi, Xinjiang 830091, China
| | - Chunjiao Zhang
- Supervision, Inspection & Testing Center of Agricultural Products Quality, Ministry of Agriculture and Rural Affairs, Beijing 100083, China
| | - Hongliang Zhu
- College of Food Science and Nutritional Engineering, China Agricultural University, Beijing, 100083, China
| |
Collapse
|
10
|
Feng YZ, Zhu QF, Xue J, Chen P, Yu Y. Shining in the dark: the big world of small peptides in plants. ABIOTECH 2023; 4:238-256. [PMID: 37970469 PMCID: PMC10638237 DOI: 10.1007/s42994-023-00100-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 02/24/2023] [Indexed: 11/17/2023]
Abstract
Small peptides represent a subset of dark matter in plant proteomes. Through differential expression patterns and modes of action, small peptides act as important regulators of plant growth and development. Over the past 20 years, many small peptides have been identified due to technical advances in genome sequencing, bioinformatics, and chemical biology. In this article, we summarize the classification of plant small peptides and experimental strategies used to identify them as well as their potential use in agronomic breeding. We review the biological functions and molecular mechanisms of small peptides in plants, discuss current problems in small peptide research and highlight future research directions in this field. Our review provides crucial insight into small peptides in plants and will contribute to a better understanding of their potential roles in biotechnology and agriculture.
Collapse
Affiliation(s)
- Yan-Zhao Feng
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Qing-Feng Zhu
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Jiao Xue
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Pei Chen
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| | - Yang Yu
- Guangdong Key Laboratory of Crop Germplasm Resources Preservation and Utilization, Key Laboratory of South China Modern Biological Seed Industry, Ministry of Agriculture and Rural Affairs, Agro-Biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, 510640 China
| |
Collapse
|
11
|
Jaiswal M, Kumar S. smAMPsTK: a toolkit to unravel the smORFome encoding AMPs of plant species. J Biomol Struct Dyn 2023:1-13. [PMID: 37464885 DOI: 10.1080/07391102.2023.2235605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/06/2023] [Indexed: 07/20/2023]
Abstract
The pervasive repertoire of plant molecules with the potential to serve as a substitute for conventional antibiotics has led to obtaining better insights into plant-derived antimicrobial peptides (AMPs). The massive distribution of Small Open Reading Frames (smORFs) throughout eukaryotic genomes with proven extensive biological functions reflects their practicality as antimicrobials. Here, we have developed a pipeline named smAMPsTK to unveil the underlying hidden smORFs encoding AMPs for plant species. By applying this pipeline, we have elicited AMPs of various functional activity of lengths ranging from 5 to 100 aa by employing publicly available transcriptome data of five different angiosperms. Later, we studied the coding potential of AMPs-smORFs, the inclusion of diverse translation initiation start codons, and amino acid frequency. Codon usage study signifies no such codon usage biases for smORFs encoding AMPs. Majorly three start codons are prominent in generating AMPs. The evolutionary and conservational study proclaimed the widespread distribution of AMPs encoding genes throughout the plant kingdom. Domain analysis revealed that nearly all AMPs have chitin-binding ability, establishing their role as antifungal agents. The current study includes a developed methodology to characterize smORFs encoding AMPs, and their implications as antimicrobial, antibacterial, antifungal, or antiviral provided by SVM score and prediction status calculated by machine learning-based prediction models. The pipeline, complete package, and the results derived for five angiosperms are freely available at https://github.com/skbinfo/smAMPsTK.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Mohini Jaiswal
- Bioinformatics Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, India
| | - Shailesh Kumar
- Bioinformatics Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, India
| |
Collapse
|
12
|
Yang J, Yue HR, Pan LY, Feng JX, Zhao S, Suwannarangsee S, Chempreda V, Liu CG, Zhao XQ. Fungal strain improvement for efficient cellulase production and lignocellulosic biorefinery: Current status and future prospects. BIORESOURCE TECHNOLOGY 2023:129449. [PMID: 37406833 DOI: 10.1016/j.biortech.2023.129449] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 06/29/2023] [Accepted: 07/01/2023] [Indexed: 07/07/2023]
Abstract
Lignocellulosic biomass (LCB) has been recognized as a valuable carbon source for the sustainable production of biofuels and value-added biochemicals. Crude enzymes produced by fungal cell factories benefit economic LCB degradation. However, high enzyme production cost remains a great challenge. Filamentous fungi have been widely used to produce cellulolytic enzymes. Metabolic engineering of fungi contributes to efficient cellulase production for LCB biorefinery. Here the latest progress in utilizing fungal cell factories for cellulase production was summarized, including developing genome engineering tools to improve the efficiency of fungal cell factories, manipulating promoters, and modulating transcription factors. Multi-omics analysis of fungi contributes to identifying novel genetic elements for enhancing cellulase production. Furthermore, the importance of translation regulation of cellulase production are emphasized. Efficient development of fungal cell factories based on integrative strain engineering would benefit the overall bioconversion efficacy of LCB for sustainable bioproduction.
Collapse
Affiliation(s)
- Jie Yang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Hou-Ru Yue
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Li-Ya Pan
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi Research Center for Microbial and Enzyme Engineering Technology, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Jia-Xun Feng
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi Research Center for Microbial and Enzyme Engineering Technology, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Shuai Zhao
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi Research Center for Microbial and Enzyme Engineering Technology, College of Life Science and Technology, Guangxi University, Nanning 530004, China
| | - Surisa Suwannarangsee
- National Center for Genetic Engineering and Biotechnology (BIOTEC), 113 Thailand Science Park, Phaholyothin Road, Khlong Luang, Pathumthani 12120, Thailand
| | - Verawat Chempreda
- National Center for Genetic Engineering and Biotechnology (BIOTEC), 113 Thailand Science Park, Phaholyothin Road, Khlong Luang, Pathumthani 12120, Thailand
| | - Chen-Guang Liu
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Xin-Qing Zhao
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China.
| |
Collapse
|
13
|
Dong X, Zhang K, Xun C, Chu T, Liang S, Zeng Y, Liu Z. Small Open Reading Frame-Encoded Micro-Peptides: An Emerging Protein World. Int J Mol Sci 2023; 24:10562. [PMID: 37445739 DOI: 10.3390/ijms241310562] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Revised: 06/20/2023] [Accepted: 06/21/2023] [Indexed: 07/15/2023] Open
Abstract
Small open reading frames (sORFs) are often overlooked features in genomes. In the past, they were labeled as noncoding or "transcriptional noise". However, accumulating evidence from recent years suggests that sORFs may be transcribed and translated to produce sORF-encoded polypeptides (SEPs) with less than 100 amino acids. The vigorous development of computational algorithms, ribosome profiling, and peptidome has facilitated the prediction and identification of many new SEPs. These SEPs were revealed to be involved in a wide range of basic biological processes, such as gene expression regulation, embryonic development, cellular metabolism, inflammation, and even carcinogenesis. To effectively understand the potential biological functions of SEPs, we discuss the history and development of the newly emerging research on sORFs and SEPs. In particular, we review a range of recently discovered bioinformatics tools for identifying, predicting, and validating SEPs as well as a variety of biochemical experiments for characterizing SEP functions. Lastly, this review underlines the challenges and future directions in identifying and validating sORFs and their encoded micropeptides, providing a significant reference for upcoming research on sORF-encoded peptides.
Collapse
Affiliation(s)
- Xiaoping Dong
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Kun Zhang
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Chengfeng Xun
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Tianqi Chu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Songping Liang
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| | - Yong Zeng
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
- The State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha 410081, China
| | - Zhonghua Liu
- National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, China
- Peptide and Small Molecule Drug R&D Platform, Furong Laboratory, Hunan Normal University, Changsha 410081, China
| |
Collapse
|
14
|
Wang Z, Cui Q, Su C, Zhao S, Wang R, Wang Z, Meng J, Luan Y. Unveiling the secrets of non-coding RNA-encoded peptides in plants: A comprehensive review of mining methods and research progress. Int J Biol Macromol 2023:124952. [PMID: 37257526 DOI: 10.1016/j.ijbiomac.2023.124952] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 05/15/2023] [Accepted: 05/16/2023] [Indexed: 06/02/2023]
Abstract
Non-coding RNAs (ncRNAs) are not conventionally involved in protein encoding. However, recent findings indicate that ncRNAs possess the capacity to code for proteins or peptides. These ncRNA-encoded peptides (ncPEPs) are vital for diverse plant life processes and exhibit significant potential value. Despite their importance, research on plant ncPEPs is limited, with only a few studies conducted and less information on the underlying mechanisms, and the field remains in its nascent stage. This manuscript provides a comprehensive overview of ncPEPs mining methods in plants, focusing on prediction, identification, and functional analysis. We discuss the strengths and weaknesses of various techniques, identify future research directions in the ncPEPs domain, and elucidate the biological functions and agricultural application prospects of plant ncPEPs. By highlighting the immense potential and research value of ncPEPs, we aim to lay a solid foundation for more in-depth studies in plant science.
Collapse
Affiliation(s)
- Zhengjie Wang
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China
| | - Qi Cui
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China
| | - Chenglin Su
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China
| | - Siyuan Zhao
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Ruiming Wang
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China
| | - Zhicheng Wang
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, China
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian 116024, China.
| |
Collapse
|
15
|
Zhao S, Meng J, Wekesa JS, Luan Y. Identification of small open reading frames in plant lncRNA using class-imbalance learning. Comput Biol Med 2023; 157:106773. [PMID: 36924731 DOI: 10.1016/j.compbiomed.2023.106773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 02/21/2023] [Accepted: 03/09/2023] [Indexed: 03/12/2023]
Abstract
Recently, small open reading frames (sORFs) in long noncoding RNA (lncRNA) have been demonstrated to encode small peptides that can help study the mechanisms of growth and development in organisms. Since machine learning-based computational methods are less costly compared with biological experiments, they can be used to identify sORFs and provide a basis for biological experiments. However, few computational methods and data resources have been exploited for identifying sORFs in plant lncRNA. Besides, machine learning models produce underperforming classifiers when faced with a class-imbalance problem. In this study, an alternative method called SMOTE based on weighted cosine distance (WCDSMOTE) which enables interaction with feature selection is put forward to synthesize minority class samples and weighted edited nearest neighbor (WENN) is applied to clean up majority class samples, thus, hybrid sampling WCDSMOTE-ENN is proposed to deal with imbalanced datasets with the multi-angle feature. A heterogeneous classifier ensemble is introduced to complete the classification task. Therefore, a novel computational method that is based on class-imbalance learning to identify the sORFs with coding potential in plant lncRNA (sORFplnc) is presented. Experimental results manifest that sORFplnc outperforms existing computational methods in identifying sORFs with coding potential. We anticipate that the proposed work can be a reference for relevant research and contribute to agriculture and biomedicine.
Collapse
Affiliation(s)
- Siyuan Zhao
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning, 116024, China.
| | - Jael Sanyanda Wekesa
- Department of Information Technology, Jomo Kenyatta University of Agriculture and Technology, Nairobi, 62000-00200, Kenya
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning, 116024, China
| |
Collapse
|
16
|
Yu J, Jiang W, Zhu SB, Liao Z, Dou X, Liu J, Guo FB, Dong C. Prediction of protein-coding small ORFs in multi-species using integrated sequence-derived features and the random forest model. Methods 2023; 210:10-19. [PMID: 36621557 DOI: 10.1016/j.ymeth.2022.12.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Revised: 12/27/2022] [Accepted: 12/30/2022] [Indexed: 01/07/2023] Open
Abstract
Proteins encoded by small open reading frames (sORFs) can serve as functional elements playing important roles in vivo. Such sORFs also constitute the potential pool for facilitating the de novo gene birth, driving evolutionary innovation and species diversity. Therefore, their theoretical and experimental identification has become a critical issue. Herein, we proposed a protein-coding sORFs prediction method merely based on integrative sequence-derived features. Our prediction performance is better or comparable compared with other nine prevalent methods, which shows that our method can provide a relatively reliable research tool for the prediction of protein-coding sORFs. Our method allows users to estimate the potential expression of a queried sORF, which has been demonstrated by the correlation analysis between our possibility estimation and codon adaption index (CAI). Based on the features that we used, we demonstrated that the sequence features of the protein-coding sORFs in the two domains have significant differences implying that it might be a relatively hard task in terms of cross-domain prediction, hence domain-specific models were developed, which allowed users to predict protein-coding sORFs both in eukaryotes and prokaryotes. Finally, a web-server was developed and provided to boost and facilitate the study of the related field, which is freely available at http://guolab.whu.edu.cn/codingCapacity/index.html.
Collapse
Affiliation(s)
- Jiafeng Yu
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou 253023, China
| | - Wenwen Jiang
- Department of Bioinformatics, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
| | - Sen-Bin Zhu
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Zhen Liao
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu 611731, China
| | - Xianghua Dou
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou 253023, China
| | - Jian Liu
- Shandong Key Laboratory of Biophysics, Institute of Biophysics, Dezhou University, Dezhou 253023, China
| | - Feng-Biao Guo
- School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, China.
| | - Chuan Dong
- School of Pharmaceutical Sciences, Wuhan University, Wuhan 430071, China.
| |
Collapse
|
17
|
Mamaeva A, Lyapina I, Knyazev A, Golub N, Mollaev T, Chudinova E, Elansky S, Babenko VV, Veselovsky VA, Klimina KM, Gribova T, Kharlampieva D, Lazarev V, Fesenko I. RALF peptides modulate immune response in the moss Physcomitrium patens. FRONTIERS IN PLANT SCIENCE 2023; 14:1077301. [PMID: 36818838 PMCID: PMC9933782 DOI: 10.3389/fpls.2023.1077301] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Accepted: 01/13/2023] [Indexed: 06/18/2023]
Abstract
BACKGROUND RAPID ALKALINIZATION FACTOR (RALFs) are cysteine-rich peptides that regulate multiple physiological processes in plants. This peptide family has considerably expanded during land plant evolution, but the role of ancient RALFs in modulating stress responses is unknown.Results: Here, we used the moss Physcomitrium patens as a model to gain insight into the role of RALF peptides in the coordination of plant growth and stress response in non-vascular plants. The quantitative proteomic analysis revealed concerted downregulation of M6 metalloprotease and some membrane proteins, including those involved in stress response, in PpRALF1, 2 and 3 knockout (KO) lines. The subsequent analysis revealed the role of PpRALF3 in growth regulation under abiotic and biotic stress conditions, implying the importance of RALFs in responding to various adverse conditions in bryophytes. We found that knockout of the PpRALF2 and PpRALF3 genes resulted in increased resistance to bacterial and fungal phytopathogens, Pectobacterium carotovorum and Fusarium solani, suggesting the role of these peptides in negative regulation of the immune response in P. patens. Comparing the transcriptomes of PpRALF3 KO and wild-type plants infected by F. solani showed that the regulation of genes in the phenylpropanoid pathway and those involved in cell wall modification and biogenesis was different in these two genotypes. CONCLUSION Thus, our study sheds light on the function of the previously uncharacterized PpRALF3 peptide and gives a clue to the ancestral functions of RALF peptides in plant stress response.
Collapse
Affiliation(s)
- Anna Mamaeva
- Laboratory of System Analysis of Proteins and Peptides, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Irina Lyapina
- Laboratory of System Analysis of Proteins and Peptides, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Andrey Knyazev
- Laboratory of System Analysis of Proteins and Peptides, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Nina Golub
- Laboratory of System Analysis of Proteins and Peptides, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Timur Mollaev
- Agrarian and Technological Institute, Peoples Friendship University of Russia (RUDN University), Moscow, Russia
| | - Elena Chudinova
- Agrarian and Technological Institute, Peoples Friendship University of Russia (RUDN University), Moscow, Russia
| | - Sergey Elansky
- Agrarian and Technological Institute, Peoples Friendship University of Russia (RUDN University), Moscow, Russia
- Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia
| | - Vladislav V. Babenko
- Laboratory of Genetic Engineering, Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Vladimir A. Veselovsky
- Laboratory of Genetic Engineering, Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Ksenia M. Klimina
- Laboratory of Genetic Engineering, Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Tatiana Gribova
- Laboratory of Genetic Engineering, Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Daria Kharlampieva
- Laboratory of Genetic Engineering, Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
| | - Vassili Lazarev
- Laboratory of Genetic Engineering, Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow, Russia
- Department of Molecular and Translational Medicine, Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Moscow, Russia
| | - Igor Fesenko
- Laboratory of System Analysis of Proteins and Peptides, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
18
|
Kobayashi H, Murakami K, Sugano SS, Tamura K, Oka Y, Matsushita T, Shimada T. Comprehensive analysis of peptide-coding genes and initial characterization of an LRR-only microprotein in Marchantia polymorpha. FRONTIERS IN PLANT SCIENCE 2023; 13:1051017. [PMID: 36756228 PMCID: PMC9901580 DOI: 10.3389/fpls.2022.1051017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 12/28/2022] [Indexed: 06/18/2023]
Abstract
In the past two decades, many plant peptides have been found to play crucial roles in various biological events by mediating cell-to-cell communications. However, a large number of small open reading frames (sORFs) or short genes capable of encoding peptides remain uncharacterized. In this study, we examined several candidate genes for peptides conserved between two model plants: Arabidopsis thaliana and Marchantia polymorpha. We examined their expression pattern in M. polymorpha and subcellular localization using a transient assay with Nicotiana benthamiana. We found that one candidate, MpSGF10B, was expressed in meristems, gemma cups, and male reproductive organs called antheridiophores. MpSGF10B has an N-terminal signal peptide followed by two leucine-rich repeat (LRR) domains and was secreted to the extracellular region in N. benthamiana and M. polymorpha. Compared with the wild type, two independent Mpsgf10b mutants had a slightly increased number of antheridiophores. It was revealed in gene ontology enrichment analysis that MpSGF10B was significantly co-expressed with genes related to cell cycle and development. These results suggest that MpSGF10B may be involved in the reproductive development of M. polymorpha. Our research should shed light on the unknown role of LRR-only proteins in land plants.
Collapse
Affiliation(s)
| | | | - Shigeo S. Sugano
- Bioproduction Research Institute, The National Institute of Advanced Industrial Science and Technology, Tsukuba, Ibaraki, Japan
| | - Kentaro Tamura
- Department of Environmental and Life Sciences, University of Shizuoka, Shizuoka-shi, Shizuoka, Japan
| | - Yoshito Oka
- Graduate School of Science, Kyoto University, Kyoto, Japan
| | | | - Tomoo Shimada
- Graduate School of Science, Kyoto University, Kyoto, Japan
| |
Collapse
|
19
|
Complementary peptides represent a credible alternative to agrochemicals by activating translation of targeted proteins. Nat Commun 2023; 14:254. [PMID: 36650156 PMCID: PMC9845214 DOI: 10.1038/s41467-023-35951-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 01/10/2023] [Indexed: 01/18/2023] Open
Abstract
The current agriculture main challenge is to maintain food production while facing multiple threats such as increasing world population, temperature increase, lack of agrochemicals due to health issues and uprising of weeds resistant to herbicides. Developing novel, alternative, and safe methods is hence of paramount importance. Here, we show that complementary peptides (cPEPs) from any gene can be designed to target specifically plant coding genes. External application of synthetic peptides increases the abundance of the targeted protein, leading to related phenotypes. Moreover, we provide evidence that cPEPs can be powerful tools in agronomy to improve plant traits, such as growth, resistance to pathogen or heat stress, without the needs of genetic approaches. Finally, by combining their activity they can also be used to reduce weed growth.
Collapse
|
20
|
Álvarez-Urdiola R, Borràs E, Valverde F, Matus JT, Sabidó E, Riechmann JL. Peptidomics Methods Applied to the Study of Flower Development. Methods Mol Biol 2023; 2686:509-536. [PMID: 37540375 DOI: 10.1007/978-1-0716-3299-4_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023]
Abstract
Understanding the global and dynamic nature of plant developmental processes requires not only the study of the transcriptome, but also of the proteome, including its largely uncharacterized peptidome fraction. Recent advances in proteomics and high-throughput analyses of translating RNAs (ribosome profiling) have begun to address this issue, evidencing the existence of novel, uncharacterized, and possibly functional peptides. To validate the accumulation in tissues of sORF-encoded polypeptides (SEPs), the basic setup of proteomic analyses (i.e., LC-MS/MS) can be followed. However, the detection of peptides that are small (up to ~100 aa, 6-7 kDa) and novel (i.e., not annotated in reference databases) presents specific challenges that need to be addressed both experimentally and with computational biology resources. Several methods have been developed in recent years to isolate and identify peptides from plant tissues. In this chapter, we outline two different peptide extraction protocols and the subsequent peptide identification by mass spectrometry using the database search or the de novo identification methods.
Collapse
Affiliation(s)
- Raquel Álvarez-Urdiola
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
| | - Eva Borràs
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Federico Valverde
- Institute for Plant Biochemistry and Photosynthesis CSIC - University of Seville, Seville, Spain
| | - José Tomás Matus
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain
- Institute for Integrative Systems Biology (I2SysBio), Universitat de València-CSIC, Paterna, Valencia, Spain
| | - Eduard Sabidó
- Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - José Luis Riechmann
- Centre for Research in Agricultural Genomics (CRAG) CSIC-IRTA-UAB-UB, Edifici CRAG, Campus UAB, Cerdanyola del Vallès, Barcelona, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain.
| |
Collapse
|
21
|
Çakır U, Gabed N, Brunet M, Roucou X, Kryvoruchko I. Mosaic translation hypothesis: chimeric polypeptides produced via multiple ribosomal frameshifting as a basis for adaptability. FEBS J 2023; 290:370-378. [PMID: 34743413 DOI: 10.1111/febs.16269] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 10/03/2021] [Accepted: 11/05/2021] [Indexed: 02/05/2023]
Abstract
How many different proteins can be produced from a single spliced transcript? Genome annotation projects overlook the coding potential of reading frames other than that of the reference open reading frames (refORFs). Recently, alternative open reading frames (altORFs) and their translational products, alternative proteins, have been shown to carry out important functions in various organisms. AltORFs overlapping refORFs or other altORFs in a different reading frame may be involved in one fundamental mechanism so far overlooked. A few years ago, it was proposed that altORFs may act as building blocks for chimeric (mosaic) polypeptides, which are produced via multiple ribosomal frameshifting events from a single mature transcript. We adopt terminology from that earlier discussion and call this mechanism mosaic translation. This way of extracting and combining genetic information may significantly increase proteome diversity. Thus, we hypothesize that this mechanism may have contributed to the flexibility and adaptability of organisms to a variety of environmental conditions. Specialized ribosomes acting as sensors probably played a central role in this process. Importantly, mosaic translation may be the main source of protein diversity in genomes that lack alternative splicing. The idea of mosaic translation is a testable hypothesis, although its direct demonstration is challenging. Should mosaic translation occur, we would currently highly underestimate the complexity of translation mechanisms and thus the proteome.
Collapse
Affiliation(s)
- Umut Çakır
- Molecular Biology and Genetics Department, Faculty of Arts and Sciences, Boğaziçi University, Istanbul, Turkey
| | - Noujoud Gabed
- Cellular and Molecular Biology Department, Oran High School of Biological Sciences (ESSBO), Oran, Algeria
| | - Marie Brunet
- Department of Pediatrics, Medical Genetics Service, Université de Sherbrooke, QC, Canada.,Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), QC, Canada
| | - Xavier Roucou
- Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), QC, Canada.,Department of Biochemistry and Functional Genomics, Université de Sherbrooke, QC, Canada
| | - Igor Kryvoruchko
- Molecular Biology and Genetics Department, Faculty of Arts and Sciences, Boğaziçi University, Istanbul, Turkey
| |
Collapse
|
22
|
Zhao S, Meng J, Kang Q, Luan Y. Identifying LncRNA-Encoded Short Peptides Using Optimized Hybrid Features and Ensemble Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2873-2881. [PMID: 34383651 DOI: 10.1109/tcbb.2021.3104288] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Long non-coding RNA (lncRNA) contains short open reading frames (sORFs), and sORFs-encoded short peptides (SEPs) have become the focus of scientific studies due to their crucial role in life activities. The identification of SEPs is vital to further understanding their regulatory function. Bioinformatics methods can quickly identify SEPs to provide credible candidate sequences for verifying SEPs by biological experimenrts. However, there is a lack of methods for identifying SEPs directly. In this study, a machine learning method to identify SEPs of plant lncRNA (ISPL) is proposed. Hybrid features including sequence features and physicochemical features are extracted manually or adaptively to construct different modal features. In order to keep the stability of feature selection, the non-linear correction applied in Max-Relevance-Max-Distance (nocRD) feature selection method is proposed, which integrates multiple feature ranking results and uses the iterative random forest for different modal features dimensionality reduction. Classification models with different modal features are constructed, and their outputs are combined for ensemble classification. The experimental results show that the accuracy of ISPL is 89.86% percent on the independent test set, which will have important implications for further studies of functional genomic.
Collapse
|
23
|
Identification and analysis of smORFs in Chlamydomonas reinhardtii. Genomics 2022; 114:110444. [PMID: 35933072 DOI: 10.1016/j.ygeno.2022.110444] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 07/06/2022] [Accepted: 07/31/2022] [Indexed: 11/24/2022]
Abstract
Small open reading frames (smORFs) have been acknowledged as an important partner in organism functions ranging from bacteria to higher eukaryotes. However, lack of investigation of smORFs in green algae, despite their importance in ecology and evolution. We applied bioinformatic analysis, ribosome profiling, and small peptide proteomics to provide a genome-wide and high-confident smORF database in the model green alga Chlamydomonas reinhardtii. The whole genome was screened first to mine potential coding smORFs. Then conservative analysis, ribosome profiling, and proteomics data were processed to identify conserved smORFs and generate translation evidence. The combination of procedures resulted in 2014 smORFs that might exist in the C. reinhardtii genome. The expression of smORFs in Cd treatment suggested that two smORFs might participate in redox reaction, three in inorganic phosphate transport, and one in DNA repair under stress. Our study built a genome-widely database in C. reinhardtii, providing target smORFs for further research.
Collapse
|
24
|
Pei MS, Liu HN, Wei TL, Yu YH, Guo DL. Large-scale discovery of non-conventional peptides in grape ( Vitis vinifera L.) through peptidogenomics. HORTICULTURE RESEARCH 2022; 9:uhac023. [PMID: 35531313 PMCID: PMC9070638 DOI: 10.1093/hr/uhac023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 01/24/2022] [Indexed: 06/14/2023]
Abstract
Non-conventional peptides (NCPs), which are peptides derived from previously unannotated coding sequences, play important biological roles in plants. In this study, we used peptidogenomic methods that integrated mass spectrometry (MS) peptidomics and a six-frame translation database to extensively identify NCPs in grape. In total, 188 and 2021 non-redundant peptides from the Arabidopsis thaliana and Vitis vinifera L. protein database at Ensembl/URGI and an individualized peptidogenomic database were identified. Unlike conventional peptides, these NCPs derived mainly from intergenic, intronic, upstream ORF, 5'UTR, 3'UTR, and downstream ORF regions. These results show that unannotated regions are translated more broadly than we thought. We also found that most NCPs were derived from regions related to phenotypic variations, LTR retrotransposons, and domestication selection, indicating that the NCPs have an important function in complex biological processes. We also found that the NCPs were developmentally specific and had transient and specific functions in grape berry development. In summary, our study is the first to extensively identify NCPs in grape. It demonstrated that there was a large amount of translation in the genome. These results lay a foundation for studying the functions of NCPs and also provide a reference for the discovery of new functional genes in grape.
Collapse
Affiliation(s)
- Mao-Song Pei
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | - Hai-Nan Liu
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | - Tong-Lu Wei
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | - Yi-He Yu
- College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023, Henan Province, China
- Henan Engineering Technology Research Center of Quality Regulation and Controlling of Horticultural Plants, Luoyang 471023, China
| | | |
Collapse
|
25
|
Small open reading frames in plant research: from prediction to functional characterization. 3 Biotech 2022; 12:76. [PMID: 35251879 PMCID: PMC8873315 DOI: 10.1007/s13205-022-03147-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 02/11/2022] [Indexed: 11/01/2022] Open
Abstract
Gene prediction is a laborious and time-consuming task. The advancement of sequencing technologies and bioinformatics tools, coupled with accelerated rate of ribosome profiling and mass spectrometry development, have made identification of small open reading frames (sORFs) (< 100 codons) in various plant genomes possible. The past 50 years have seen sORFs being isolated from many organisms. However, to date, a comprehensive sORF annotation pipeline is as yet unavailable, hence, addressed in our review. Here, we also provide current information on classification and functions of plant sORFs and their potential applications in crop improvement programs.
Collapse
|
26
|
The dark proteome: translation from noncanonical open reading frames. Trends Cell Biol 2022; 32:243-258. [PMID: 34844857 PMCID: PMC8934435 DOI: 10.1016/j.tcb.2021.10.010] [Citation(s) in RCA: 58] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 10/26/2021] [Accepted: 10/29/2021] [Indexed: 02/07/2023]
Abstract
Omics-based technologies have revolutionized our understanding of the coding potential of the genome. In particular, these studies revealed widespread unannotated open reading frames (ORFs) throughout genomes and that these regions have the potential to encode novel functional (micro-)proteins and/or hold regulatory roles. However, despite their genomic prevalence, relatively few of these noncanonical ORFs have been functionally characterized, likely in part due to their under-recognition by the broader scientific community. The few that have been investigated in detail have demonstrated their essentiality in critical and divergent biological processes. As such, here we aim to discuss recent advances in understanding the diversity of noncanonical ORFs and their roles, as well as detail biologically important examples within the context of the mammalian genome.
Collapse
|
27
|
Mamaeva A, Knyazev A, Glushkevich A, Fesenko I. Quantitative proteomic dataset of the moss Physcomitrium patens PSEP3 KO and OE mutant lines. Data Brief 2022; 40:107715. [PMID: 34977300 PMCID: PMC8688553 DOI: 10.1016/j.dib.2021.107715] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 12/08/2021] [Accepted: 12/13/2021] [Indexed: 11/26/2022] Open
Abstract
Small open reading frames (<100 codons) that are located on long noncoding RNAs (lncRNAs) can encode functional microproteins. These microproteins are shown to play important roles in different cellular processes, such as cell proliferation, development and disease response [1], [2], [3], [4], [5], [6]. However, there are only a few known lncRNA-encoded functional microproteins in plants. One such microprotein that was named PSEP3, was identified in the moss Physcomitrium patens by mass-spectrometry analysis. 57-aa PSEP3 contains Low Complexity Region (LCR) enriched with proline. We have previously shown that PSEP3 is translated in protonemata and gametophores of P. patens, and its knockout (KO line) or overexpression (OE line) affects protonemata growth [7]. We performed a quantitative proteomic analysis of the mutant lines with PSEP3 knockout and overexpression. 7-days old protonemata of wild type (WT line) and both mutant lines (KO and OE) were collected and used for iTRAQ-based proteomic experiments. LC-MS/MS data were processed using PEAKS Studio v.8 software with protein identification based on a Phytozome protein database. More analysis of PSEP3 effects on plant growth can be obtained in the paper published in Nucleic Acid Research [8].
Collapse
|
28
|
Lyapina I, Ivanov V, Fesenko I. Peptidome: Chaos or Inevitability. Int J Mol Sci 2021; 22:13128. [PMID: 34884929 PMCID: PMC8658490 DOI: 10.3390/ijms222313128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/01/2021] [Accepted: 12/02/2021] [Indexed: 12/13/2022] Open
Abstract
Thousands of naturally occurring peptides differing in their origin, abundance and possible functions have been identified in the tissue and biological fluids of vertebrates, insects, fungi, plants and bacteria. These peptide pools are referred to as intracellular or extracellular peptidomes, and besides a small proportion of well-characterized peptide hormones and defense peptides, are poorly characterized. However, a growing body of evidence suggests that unknown bioactive peptides are hidden in the peptidomes of different organisms. In this review, we present a comprehensive overview of the mechanisms of generation and properties of peptidomes across different organisms. Based on their origin, we propose three large peptide groups-functional protein "degradome", small open reading frame (smORF)-encoded peptides (smORFome) and specific precursor-derived peptides. The composition of peptide pools identified by mass-spectrometry analysis in human cells, plants, yeast and bacteria is compared and discussed. The functions of different peptide groups, for example the role of the "degradome" in promoting defense signaling, are also considered.
Collapse
Affiliation(s)
| | | | - Igor Fesenko
- Department of Functional Genomics and Proteomics of Plants, Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry Russian Academy of Sciences, 117997 Moscow, Russia; (I.L.); (V.I.)
| |
Collapse
|
29
|
Chen L, Yang Y, Zhang Y, Li K, Cai H, Wang H, Zhao Q. The Small Open Reading Frame-Encoded Peptides: Advances in Methodologies and Functional Studies. Chembiochem 2021; 23:e202100534. [PMID: 34862721 DOI: 10.1002/cbic.202100534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 11/15/2021] [Indexed: 11/07/2022]
Abstract
Small open reading frames (sORFs) are an important class of genes with less than 100 codons. They were historically annotated as noncoding or even junk sequences. In recent years, accumulating evidence suggests that sORFs could encode a considerable number of polypeptides, many of which play important roles in both physiology and disease pathology. However, it has been technically challenging to directly detect sORF-encoded peptides (SEPs). Here, we discuss the latest advances in methodologies for identifying SEPs with mass spectrometry, as well as the progress on functional studies of SEPs.
Collapse
Affiliation(s)
- Lei Chen
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China.,Laboratory for Synthetic Chemistry and Chemical Biology Limited, Hong Kong Science and Technology Park, New Territories, Hong Kong SAR, 999077, P. R. China
| | - Ying Yang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Yuanliang Zhang
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Kecheng Li
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| | - Hongmin Cai
- School of Computer Science and Engineering, South China University of Technology, Guangzhou, 510623, P. R. China
| | - Hongwei Wang
- State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, 510623, P. R. China
| | - Qian Zhao
- State Key Laboratory of Chemical Biology and Drug Discovery, Department of Applied Biology and Chemical Technology, Hong Kong Polytechnic University, Hung Hom, Hong Kong SAR, 999077, P. R. China
| |
Collapse
|
30
|
Silveira GO, Coelho HS, Amaral MS, Verjovski-Almeida S. Long non-coding RNAs as possible therapeutic targets in protozoa, and in Schistosoma and other helminths. Parasitol Res 2021; 121:1091-1115. [PMID: 34859292 DOI: 10.1007/s00436-021-07384-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 11/14/2021] [Indexed: 12/26/2022]
Abstract
Long non-coding RNAs (lncRNAs) emerged in the past 20 years due to massive amounts of scientific data regarding transcriptomic analyses. They have been implicated in a plethora of cellular processes in higher eukaryotes. However, little is known about lncRNA possible involvement in parasitic diseases, with most studies only detecting their presence in parasites of human medical importance. Here, we review the progress on lncRNA studies and their functions in protozoans and helminths. In addition, we show an example of knockdown of one lncRNA in Schistosoma mansoni, SmLINC156349, which led to in vitro parasite adhesion, motility, and pairing impairment, with a 20% decrease in parasite viability and 33% reduction in female oviposition. Other observed phenotypes were a decrease in the proliferation rate of both male and female worms and their gonads, and reduced female lipid and vitelline droplets that are markers for well-developed vitellaria. Impairment of female worms' vitellaria in SmLINC156349-silenced worms led to egg development deficiency. All those results demonstrate the great potential of the tools and methods to characterize lncRNAs as potential new therapeutic targets. Further, we discuss the challenges and limitations of current methods for studying lncRNAs in parasites and possible solutions to overcome them, and we highlight the future directions of this exciting field.
Collapse
Affiliation(s)
- Gilbert O Silveira
- Laboratório de Parasitologia, Instituto Butantan, São Paulo, SP, 05503-900, Brazil.,Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, 05508-900, Brazil
| | - Helena S Coelho
- Laboratório de Parasitologia, Instituto Butantan, São Paulo, SP, 05503-900, Brazil
| | - Murilo S Amaral
- Laboratório de Parasitologia, Instituto Butantan, São Paulo, SP, 05503-900, Brazil.
| | - Sergio Verjovski-Almeida
- Laboratório de Parasitologia, Instituto Butantan, São Paulo, SP, 05503-900, Brazil. .,Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, 05508-900, Brazil.
| |
Collapse
|
31
|
Fesenko I, Shabalina SA, Mamaeva A, Knyazev A, Glushkevich A, Lyapina I, Ziganshin R, Kovalchuk S, Kharlampieva D, Lazarev V, Taliansky M, Koonin EV. A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants. Nucleic Acids Res 2021; 49:10328-10346. [PMID: 34570232 DOI: 10.1093/nar/gkab816] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/17/2021] [Accepted: 09/17/2021] [Indexed: 12/17/2022] Open
Abstract
Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousands of small open reading frames (smORFs, <100 codons) located on lncRNAs potentially might be translated into peptides or microproteins. We report a comprehensive analysis of the conservation and evolutionary trajectories of lncRNAs-smORFs from the moss Physcomitrium patens across transcriptomes of 479 plant species. Although thousands of smORFs are subject to substantial purifying selection, the majority of the smORFs appear to be evolutionary young and could represent a major pool for functional innovation. Using nanopore RNA sequencing, we show that, on average, the transcriptional level of conserved smORFs is higher than that of non-conserved smORFs. Proteomic analysis confirmed translation of 82 novel species-specific smORFs. Numerous conserved smORFs containing low complexity regions (LCRs) or transmembrane domains were identified, the biological functions of a selected LCR-smORF were demonstrated experimentally. Thus, microproteins encoded by smORFs are a major, functionally diverse component of the plant proteome.
Collapse
Affiliation(s)
- Igor Fesenko
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Anna Mamaeva
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Andrey Knyazev
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Anna Glushkevich
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Irina Lyapina
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Rustam Ziganshin
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Sergey Kovalchuk
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation
| | - Daria Kharlampieva
- Department of Cell Biology, Federal Research and Clinical Center of Physical -Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russian Federation
| | - Vassili Lazarev
- Department of Cell Biology, Federal Research and Clinical Center of Physical -Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russian Federation.,Moscow Institute of Physics and Technology (National Research University), Dolgoprudny, Moscow region, 141701, Russian Federation
| | - Michael Taliansky
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, Moscow 117997, Russian Federation.,The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
32
|
Guerra-Almeida D, Tschoeke DA, da-Fonseca RN. Understanding small ORF diversity through a comprehensive transcription feature classification. DNA Res 2021; 28:6317669. [PMID: 34240112 PMCID: PMC8435553 DOI: 10.1093/dnares/dsab007] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Indexed: 11/13/2022] Open
Abstract
Small open reading frames (small ORFs/sORFs/smORFs) are potentially coding sequences smaller than 100 codons that have historically been considered junk DNA by gene prediction software and in annotation screening; however, the advent of next-generation sequencing has contributed to the deeper investigation of junk DNA regions and their transcription products, resulting in the emergence of smORFs as a new focus of interest in systems biology. Several smORF peptides were recently reported in noncanonical mRNAs as new players in numerous biological contexts; however, their relevance is still overlooked in coding potential analysis. Hence, this review proposes a smORF classification based on transcriptional features, discussing the most promising approaches to investigate smORFs based on their different characteristics. First, smORFs were divided into nonexpressed (intergenic) and expressed (genic) smORFs. Second, genic smORFs were classified as smORFs located in noncoding RNAs (ncRNAs) or canonical mRNAs. Finally, smORFs in ncRNAs were further subdivided into sequences located in small or long RNAs, whereas smORFs located in canonical mRNAs were subdivided into several specific classes depending on their localization along the gene. We hope that this review provides new insights into large-scale annotations and reinforces the role of smORFs as essential components of a hidden coding DNA world.
Collapse
Affiliation(s)
- Diego Guerra-Almeida
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Diogo Antonio Tschoeke
- Alberto Luiz Coimbra Institute of Graduate Studies and Engineering Research (COPPE), Biomedical Engineering Program, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Rodrigo Nunes- da-Fonseca
- Institute of Biodiversity and Sustainability, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil.,National Institute of Science and Technology in Molecular Entomology, Rio de Janeiro, Brazil
| |
Collapse
|
33
|
Lyapina I, Filippova A, Kovalchuk S, Ziganshin R, Mamaeva A, Lazarev V, Latsis I, Mikhalchik E, Panasenko O, Ivanov O, Ivanov V, Fesenko I. Possible role of small secreted peptides (SSPs) in immune signaling in bryophytes. PLANT MOLECULAR BIOLOGY 2021; 106:123-143. [PMID: 33713297 DOI: 10.1007/s11103-021-01133-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 02/22/2021] [Indexed: 06/12/2023]
Abstract
Plants utilize a plethora of peptide signals to regulate their immune response. Peptide ligands and their cognate receptors involved in immune signaling share common motifs among many species of vascular plants. However, the origin and evolution of immune peptides is still poorly understood. Here, we searched for genes encoding small secreted peptides in the genomes of three bryophyte lineages-mosses, liverworts and hornworts-that occupy a critical position in the study of land plant evolution. We found that bryophytes shared common predicted small secreted peptides (SSPs) with vascular plants. The number of SSPs is higher in the genomes of mosses than in both the liverwort Marchantia polymorpha and the hornwort Anthoceros sp. The synthetic peptide elicitors-AtPEP and StPEP-specific for vascular plants, triggered ROS production in the protonema of the moss Physcomitrella patens, suggesting the possibility of recognizing peptide ligands from angiosperms by moss receptors. Mass spectrometry analysis of the moss Physcomitrella patens, both the wild type and the Δcerk mutant secretomes, revealed peptides that specifically responded to chitosan treatment, suggesting their role in immune signaling.
Collapse
Affiliation(s)
- Irina Lyapina
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Anna Filippova
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Sergey Kovalchuk
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Rustam Ziganshin
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Anna Mamaeva
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Vassili Lazarev
- Federal Research and Clinical Center of Physical-Chemical Medicine, Federal Medical Biological Agency, Moscow, Russia
| | - Ivan Latsis
- Federal Research and Clinical Center of Physical-Chemical Medicine, Federal Medical Biological Agency, Moscow, Russia
| | - Elena Mikhalchik
- Federal Research and Clinical Center of Physical-Chemical Medicine, Federal Medical Biological Agency, Moscow, Russia
| | - Oleg Panasenko
- Federal Research and Clinical Center of Physical-Chemical Medicine, Federal Medical Biological Agency, Moscow, Russia
| | - Oleg Ivanov
- V.F. Kuprevich Institute of Experimental Botany of the National Academy of Sciences of Belarus, Minsk, Republic of Belarus
| | - Vadim Ivanov
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Igor Fesenko
- Shemyakin and Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia.
| |
Collapse
|
34
|
Yang M, Shang X, Zhou Y, Wang C, Wei G, Tang J, Zhang M, Liu Y, Cao J, Zhang Q. Full-Length Transcriptome Analysis of Plasmodium falciparum by Single-Molecule Long-Read Sequencing. Front Cell Infect Microbiol 2021; 11:631545. [PMID: 33708645 PMCID: PMC7942025 DOI: 10.3389/fcimb.2021.631545] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 01/05/2021] [Indexed: 11/25/2022] Open
Abstract
Malaria, an infectious disease caused by Plasmodium parasites, still accounts for amounts of deaths annually in last decades. Despite the significance of Plasmodium falciparum as a model organism of malaria parasites, our understanding of gene expression of this parasite remains largely elusive since lots of progress on its genome and transcriptome are based on assembly with short sequencing reads. Herein, we report the new version of transcriptome dataset containing all full-length transcripts over the whole asexual blood stages by adopting a full-length sequencing approach with optimized experimental conditions of cDNA library preparation. We have identified a total of 393 alternative splicing (AS) events, 3,623 long non-coding RNAs (lncRNAs), 1,555 alternative polyadenylation (APA) events, 57 transcription factors (TF), 1,721 fusion transcripts in P. falciparum. Furthermore, the shotgun proteome was performed to validate the full-length transcriptome of P. falciparum. More importantly, integration of full-length transcriptomic and proteomic data identified 160 novel small proteins in lncRNA regions. Collectively, this full-length transcriptome dataset with high quality and accuracy and the shotgun proteome analyses shed light on the complex gene expression in malaria parasites and provide a valuable resource for related functional and mechanistic researches on P. falciparum genes.
Collapse
Affiliation(s)
- Mengquan Yang
- Research Center for Translational Medicine, Key Laboratory of Arrhythmias of the Ministry of Education of China, East Hospital, Tongji University School of Medicine, Shanghai, China.,State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,CAS Key Laboratory of Synthetic Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Xiaomin Shang
- Research Center for Translational Medicine, Key Laboratory of Arrhythmias of the Ministry of Education of China, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Yiqing Zhou
- CAS Key Laboratory of Synthetic Biology, CAS Center for Excellence in Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai, China
| | - Changhong Wang
- Research Center for Translational Medicine, Key Laboratory of Arrhythmias of the Ministry of Education of China, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Guiying Wei
- Research Center for Translational Medicine, Key Laboratory of Arrhythmias of the Ministry of Education of China, East Hospital, Tongji University School of Medicine, Shanghai, China
| | - Jianxia Tang
- National Health Commission Key Laboratory of Parasitic Disease Control and Prevention, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Institute of Parasitic Diseases, Wuxi, China
| | - Meihua Zhang
- National Health Commission Key Laboratory of Parasitic Disease Control and Prevention, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Institute of Parasitic Diseases, Wuxi, China
| | - Yaobao Liu
- National Health Commission Key Laboratory of Parasitic Disease Control and Prevention, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Institute of Parasitic Diseases, Wuxi, China
| | - Jun Cao
- National Health Commission Key Laboratory of Parasitic Disease Control and Prevention, Jiangsu Provincial Key Laboratory on Parasite and Vector Control Technology, Jiangsu Institute of Parasitic Diseases, Wuxi, China.,Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, China
| | - Qingfeng Zhang
- Research Center for Translational Medicine, Key Laboratory of Arrhythmias of the Ministry of Education of China, East Hospital, Tongji University School of Medicine, Shanghai, China
| |
Collapse
|
35
|
Schlesinger D, Elsässer SJ. Revisiting sORFs: overcoming challenges to identify and characterize functional microproteins. FEBS J 2021; 289:53-74. [PMID: 33595896 DOI: 10.1111/febs.15769] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 01/17/2021] [Accepted: 02/15/2021] [Indexed: 02/07/2023]
Abstract
Short ORFs (sORFs), that is, occurrences of a start and stop codon within 100 codons or less, can be found in organisms of all domains of life, outnumbering annotated protein-coding ORFs by orders of magnitude. Even though functional proteins smaller than 100 amino acids are known, the coding potential of sORFs has often been overlooked, as it is not trivial to predict and test for functionality within the large number of sORFs. Recent advances in ribosome profiling and mass spectrometry approaches, together with refined bioinformatic predictions, have enabled a huge leap forward in this field and identified thousands of likely coding sORFs. A relatively low number of small proteins or microproteins produced from these sORFs have been characterized so far on the molecular, structural, and/or mechanistic level. These however display versatile and, in some cases, essential cellular functions, allowing for the exciting possibility that many more, previously unknown small proteins might be encoded in the genome, waiting to be discovered. This review will give an overview of the steadily growing microprotein field, focusing on eukaryotic small proteins. We will discuss emerging themes in the molecular action of microproteins, as well as advances and challenges in microprotein identification and characterization.
Collapse
Affiliation(s)
- Dörte Schlesinger
- Science for Life Laboratory, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden.,Ming Wai Lau Centre for Reparative Medicine, Stockholm node, Karolinska Institutet, Stockholm, Sweden
| | - Simon J Elsässer
- Science for Life Laboratory, Division of Genome Biology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden.,Ming Wai Lau Centre for Reparative Medicine, Stockholm node, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
36
|
Brunet MA, Lucier JF, Levesque M, Leblanc S, Jacques JF, Al-Saedi HRH, Guilloy N, Grenier F, Avino M, Fournier I, Salzet M, Ouangraoua A, Scott M, Boisvert FM, Roucou X. OpenProt 2021: deeper functional annotation of the coding potential of eukaryotic genomes. Nucleic Acids Res 2021; 49:D380-D388. [PMID: 33179748 PMCID: PMC7779043 DOI: 10.1093/nar/gkaa1036] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 12/12/2022] Open
Abstract
OpenProt (www.openprot.org) is the first proteogenomic resource supporting a polycistronic annotation model for eukaryotic genomes. It provides a deeper annotation of open reading frames (ORFs) while mining experimental data for supporting evidence using cutting-edge algorithms. This update presents the major improvements since the initial release of OpenProt. All species support recent NCBI RefSeq and Ensembl annotations, with changes in annotations being reported in OpenProt. Using the 131 ribosome profiling datasets re-analysed by OpenProt to date, non-AUG initiation starts are reported alongside a confidence score of the initiating codon. From the 177 mass spectrometry datasets re-analysed by OpenProt to date, the unicity of the detected peptides is controlled at each implementation. Furthermore, to guide the users, detectability statistics and protein relationships (isoforms) are now reported for each protein. Finally, to foster access to deeper ORF annotation independently of one's bioinformatics skills or computational resources, OpenProt now offers a data analysis platform. Users can submit their dataset for analysis and receive the results from the analysis by OpenProt. All data on OpenProt are freely available and downloadable for each species, the release-based format ensuring a continuous access to the data. Thus, OpenProt enables a more comprehensive annotation of eukaryotic genomes and fosters functional proteomic discoveries.
Collapse
Affiliation(s)
- Marie A Brunet
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec City, QC G1V0A6, Canada
| | - Jean-François Lucier
- Center for Computational Science, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
- Biology Department, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
| | - Maxime Levesque
- Center for Computational Science, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
- Biology Department, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
| | - Sébastien Leblanc
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec City, QC G1V0A6, Canada
| | - Jean-Francois Jacques
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec City, QC G1V0A6, Canada
| | - Hassan R H Al-Saedi
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
| | - Noé Guilloy
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec City, QC G1V0A6, Canada
| | - Frederic Grenier
- Center for Computational Science, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
- Biology Department, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
| | - Mariano Avino
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
| | - Isabelle Fournier
- INSERM U1192, Laboratoire Protéomique, Réponse Inflammatoire & Spectrométrie de Masse (PRISM), Université de Lille, F-59000 Lille, France
| | - Michel Salzet
- INSERM U1192, Laboratoire Protéomique, Réponse Inflammatoire & Spectrométrie de Masse (PRISM), Université de Lille, F-59000 Lille, France
| | - Aïda Ouangraoua
- Informatics Department, Université de Sherbrooke, Sherbrooke, QC J1K 2R1, Canada
| | - Michelle S Scott
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
| | - François-Michel Boisvert
- Department of Immunology and Cellular Biology, Université de Sherbrooke, Sherbrooke, QC J1E 4K8, Canada
| | - Xavier Roucou
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, 3201 Jean Mignault, Sherbrooke, QC J1E 4K8, Canada
- PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering, Université Laval, Quebec City, QC G1V0A6, Canada
| |
Collapse
|
37
|
Fabre B, Combier JP, Plaza S. Recent advances in mass spectrometry-based peptidomics workflows to identify short-open-reading-frame-encoded peptides and explore their functions. Curr Opin Chem Biol 2021; 60:122-130. [PMID: 33401134 DOI: 10.1016/j.cbpa.2020.12.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/26/2020] [Accepted: 12/03/2020] [Indexed: 12/12/2022]
Abstract
Short open reading frame (sORF)-encoded polypeptides (SEPs) have recently emerged as key regulators of major cellular processes. Computational methods for the annotation of sORFs combined with transcriptomics and ribosome profiling approaches predicted the existence of tens of thousands of SEPs across the kingdom of life. Although, we still lack unambiguous evidence for most of them. The method of choice to validate the expression of SEPs is mass spectrometry (MS)-based peptidomics. Peptides are less abundant than proteins, which tends to hinder their detection. Therefore, optimization and enrichment methods are necessary to validate the existence of SEPs. In this article, we discuss the challenges for the detection of SEPs by MS and recent developments of biochemical approaches applied to the study of these peptides. We detail the advances made in the different key steps of a typical peptidomics workflow and highlight possible alternatives that have not been explored yet.
Collapse
Affiliation(s)
- Bertrand Fabre
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, CNRS, 31320, Auzeville-Tolosane, France.
| | - Jean-Philippe Combier
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, CNRS, 31320, Auzeville-Tolosane, France
| | - Serge Plaza
- Laboratoire de Recherche en Sciences Végétales, UMR5546, Université de Toulouse, UPS, CNRS, 31320, Auzeville-Tolosane, France
| |
Collapse
|
38
|
Taliansky M, Samarskaya V, Zavriev SK, Fesenko I, Kalinina NO, Love AJ. RNA-Based Technologies for Engineering Plant Virus Resistance. PLANTS 2021; 10:plants10010082. [PMID: 33401751 PMCID: PMC7824052 DOI: 10.3390/plants10010082] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Revised: 12/25/2020] [Accepted: 12/27/2020] [Indexed: 02/07/2023]
Abstract
In recent years, non-coding RNAs (ncRNAs) have gained unprecedented attention as new and crucial players in the regulation of numerous cellular processes and disease responses. In this review, we describe how diverse ncRNAs, including both small RNAs and long ncRNAs, may be used to engineer resistance against plant viruses. We discuss how double-stranded RNAs and small RNAs, such as artificial microRNAs and trans-acting small interfering RNAs, either produced in transgenic plants or delivered exogenously to non-transgenic plants, may constitute powerful RNA interference (RNAi)-based technology that can be exploited to control plant viruses. Additionally, we describe how RNA guided CRISPR-CAS gene-editing systems have been deployed to inhibit plant virus infections, and we provide a comparative analysis of RNAi approaches and CRISPR-Cas technology. The two main strategies for engineering virus resistance are also discussed, including direct targeting of viral DNA or RNA, or inactivation of plant host susceptibility genes. We also elaborate on the challenges that need to be overcome before such technologies can be broadly exploited for crop protection against viruses.
Collapse
Affiliation(s)
- Michael Taliansky
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (V.S.); (S.K.Z.); (I.F.); (N.O.K.)
- The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
- Correspondence: (M.T.); (A.J.L.)
| | - Viktoria Samarskaya
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (V.S.); (S.K.Z.); (I.F.); (N.O.K.)
| | - Sergey K. Zavriev
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (V.S.); (S.K.Z.); (I.F.); (N.O.K.)
| | - Igor Fesenko
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (V.S.); (S.K.Z.); (I.F.); (N.O.K.)
| | - Natalia O. Kalinina
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry of the Russian Academy of Sciences, 117997 Moscow, Russia; (V.S.); (S.K.Z.); (I.F.); (N.O.K.)
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskie Gory, 119991 Moscow, Russia
| | - Andrew J. Love
- The James Hutton Institute, Invergowrie, Dundee DD2 5DA, UK
- Correspondence: (M.T.); (A.J.L.)
| |
Collapse
|
39
|
Kirov I, Dudnikov M, Merkulov P, Shingaliev A, Omarov M, Kolganova E, Sigaeva A, Karlov G, Soloviev A. Nanopore RNA Sequencing Revealed Long Non-Coding and LTR Retrotransposon-Related RNAs Expressed at Early Stages of Triticale SEED Development. PLANTS 2020; 9:plants9121794. [PMID: 33348863 PMCID: PMC7765848 DOI: 10.3390/plants9121794] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Revised: 12/10/2020] [Accepted: 12/15/2020] [Indexed: 01/22/2023]
Abstract
The intergenic space of plant genomes encodes many functionally important yet unexplored RNAs. The genomic loci encoding these RNAs are often considered “junk”, DNA as they are frequently associated with repeat-rich regions of the genome. The latter makes the annotations of these loci and the assembly of the corresponding transcripts using short RNAseq reads particularly challenging. Here, using long-read Nanopore direct RNA sequencing, we aimed to identify these “junk” RNA molecules, including long non-coding RNAs (lncRNAs) and transposon-derived transcripts expressed during early stages (10 days post anthesis) of seed development of triticale (AABBRR, 2n = 6x = 42), an interspecific hybrid between wheat and rye. Altogether, we found 796 lncRNAs and 20 LTR retrotransposon-related transcripts (RTE-RNAs) expressed at this stage, with most of them being previously unannotated and located in the intergenic as well as intronic regions. Sequence analysis of the lncRNAs provide evidence for the frequent exonization of Class I (retrotransposons) and class II (DNA transposons) transposon sequences and suggest direct influence of “junk” DNA on the structure and origin of lncRNAs. We show that the expression patterns of lncRNAs and RTE-related transcripts have high stage specificity. In turn, almost half of the lncRNAs located in Genomes A and B have the highest expression levels at 10–30 days post anthesis in wheat. Detailed analysis of the protein-coding potential of the RTE-RNAs showed that 75% of them carry open reading frames (ORFs) for a diverse set of GAG proteins, the main component of virus-like particles of LTR retrotransposons. We further experimentally demonstrated that some RTE-RNAs originate from autonomous LTR retrotransposons with ongoing transposition activity during early stages of triticale seed development. Overall, our results provide a framework for further exploration of the newly discovered lncRNAs and RTE-RNAs in functional and genome-wide association studies in triticale and wheat. Our study also demonstrates that Nanopore direct RNA sequencing is an indispensable tool for the elucidation of lncRNA and retrotransposon transcripts.
Collapse
Affiliation(s)
- Ilya Kirov
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
- Kurchatov Genomics Center of ARRIAB, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Street, 42, 127550 Moscow, Russia
- Correspondence:
| | - Maxim Dudnikov
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
- Kurchatov Genomics Center of ARRIAB, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya Street, 42, 127550 Moscow, Russia
| | - Pavel Merkulov
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
| | - Andrey Shingaliev
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
| | - Murad Omarov
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
- Faculty of Computer Science, National Research University Higher School of Economics, Pokrovsky Boulvar, 11, 109028 Moscow, Russia
| | - Elizaveta Kolganova
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
| | - Alexandra Sigaeva
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
| | - Gennady Karlov
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
| | - Alexander Soloviev
- Laboratory of Marker-Assisted and Genomic Selection of Plants, All-Russia Research Institute of Agricultural Biotechnology, Timiryazevskaya str. 42, 127550 Moscow, Russia; (M.D.); (P.M.); (A.S.); (M.O.); (E.K.); (A.S.); (G.K.); (A.S.)
| |
Collapse
|
40
|
Wang S, Tian L, Liu H, Li X, Zhang J, Chen X, Jia X, Zheng X, Wu S, Chen Y, Yan J, Wu L. Large-Scale Discovery of Non-conventional Peptides in Maize and Arabidopsis through an Integrated Peptidogenomic Pipeline. MOLECULAR PLANT 2020; 13:1078-1093. [PMID: 32445888 DOI: 10.1016/j.molp.2020.05.012] [Citation(s) in RCA: 52] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2020] [Revised: 05/04/2020] [Accepted: 05/18/2020] [Indexed: 05/10/2023]
Abstract
Non-conventional peptides (NCPs), which include small open reading frame-encoded peptides, play critical roles in fundamental biological processes. In this study, we developed an integrated peptidogenomic pipeline using high-throughput mass spectra to probe a customized six-frame translation database and applied it to large-scale identification of NCPs in plants.A total of 1993 and 1860 NCPs were unambiguously identified in maize and Arabidopsis, respectively. These NCPs showed distinct characteristics compared with conventional peptides and were derived from introns, 3' UTRs, 5' UTRs, junctions, and intergenic regions. Furthermore, our results showed that translation events in unannotated transcripts occur more broadly than previously thought. In addition, we found that dozens of maize NCPs are enriched within regions associated with phenotypic variations and domestication selection, indicating that they potentially are involved in genetic regulation of complex traits and domestication in maize. Taken together, our study developed an integrated peptidogenomic pipeline for large-scale identification of NCPs in plants, which would facilitate global characterization of NCPs from other plants. The identification of large-scale NCPs in both monocot (maize) and dicot (Arabidopsis) plants indicates that a large portion of plant genome can be translated into biologically functional molecules, which has important implications for functional genomic studies.
Collapse
Affiliation(s)
- Shunxi Wang
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Lei Tian
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Haijun Liu
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Xiang Li
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China
| | - Jinghua Zhang
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Xueyan Chen
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Xingmeng Jia
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Xu Zheng
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Shubiao Wu
- School of Environmental and Rural Science, University of New England, Armidale, NSW 2351, Australia
| | - Yanhui Chen
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China
| | - Jianbing Yan
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
| | - Liuji Wu
- National Key Laboratory of Wheat and Maize Crop Science, Collaborative Innovation Center of Henan Grain Crops, College of Agronomy, Henan Agricultural University, Zhengzhou 450002, China.
| |
Collapse
|