1
|
Palos K, Yu L, Railey CE, Nelson Dittrich AC, Nelson ADL. Linking discoveries, mechanisms, and technologies to develop a clearer perspective on plant long noncoding RNAs. THE PLANT CELL 2023; 35:1762-1786. [PMID: 36738093 PMCID: PMC10226578 DOI: 10.1093/plcell/koad027] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Revised: 12/19/2022] [Accepted: 12/22/2022] [Indexed: 05/30/2023]
Abstract
Long noncoding RNAs (lncRNAs) are a large and diverse class of genes in eukaryotic genomes that contribute to a variety of regulatory processes. Functionally characterized lncRNAs play critical roles in plants, ranging from regulating flowering to controlling lateral root formation. However, findings from the past decade have revealed that thousands of lncRNAs are present in plant transcriptomes, and characterization has lagged far behind identification. In this setting, distinguishing function from noise is challenging. However, the plant community has been at the forefront of discovery in lncRNA biology, providing many functional and mechanistic insights that have increased our understanding of this gene class. In this review, we examine the key discoveries and insights made in plant lncRNA biology over the past two and a half decades. We describe how discoveries made in the pregenomics era have informed efforts to identify and functionally characterize lncRNAs in the subsequent decades. We provide an overview of the functional archetypes into which characterized plant lncRNAs fit and speculate on new avenues of research that may uncover yet more archetypes. Finally, this review discusses the challenges facing the field and some exciting new molecular and computational approaches that may help inform lncRNA comparative and functional analyses.
Collapse
Affiliation(s)
- Kyle Palos
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
| | - Li’ang Yu
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
| | - Caylyn E Railey
- Boyce Thompson Institute, Cornell University, Ithaca, NY 14853, USA
- Plant Biology Graduate Field, Cornell University, Ithaca, NY 14853, USA
| | | | | |
Collapse
|
2
|
Plewka P, Thompson A, Szymanski M, Nuc P, Knop K, Rasinska A, Bialkowska A, Szweykowska-Kulinska Z, Karlowski WM, Jarmolowski A. A stable tRNA-like molecule is generated from the long noncoding RNA GUT15 in Arabidopsis. RNA Biol 2018; 15:726-738. [PMID: 29561243 PMCID: PMC6152437 DOI: 10.1080/15476286.2018.1445404] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
The Arabidopsis GUT15 RNA belongs to a class of noncoding RNAs that are expressed from the intergenic regions of protein-coding genes. We show that the RNA polymerase II transcribed GUT15 transcript serves as a precursor for two stable RNA species, a tRNA-like molecule and GUT15-tRF-F5, which are both encoded by the final intron in the GUT15 gene. The GUT15-encoded tRNA-like molecule cannot be autonomously transcribed by RNA polymerase III. However, this molecule contains a CCA motif, suggesting that it may enter the tRNA maturation pathway. The GUT15-encoded tRNA-like sequence has an inhibiting effect on the splicing of its host intron. Moreover, we demonstrate that the canonical tRNA genes nested within introns do not affect the splicing patterns of their host protein-coding transcripts.
Collapse
Affiliation(s)
- Patrycja Plewka
- a Department of Gene Expression , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Agnieszka Thompson
- b Department of Computational Biology , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Maciej Szymanski
- b Department of Computational Biology , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Przemyslaw Nuc
- a Department of Gene Expression , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Katarzyna Knop
- a Department of Gene Expression , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Agnieszka Rasinska
- b Department of Computational Biology , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Aleksandra Bialkowska
- a Department of Gene Expression , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Zofia Szweykowska-Kulinska
- a Department of Gene Expression , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Wojciech M Karlowski
- b Department of Computational Biology , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| | - Artur Jarmolowski
- a Department of Gene Expression , Faculty of Biology, Institute of Molecular Biology and Biotechnology, Adam Mickiewicz University in Poznan , Poznan , Poland
| |
Collapse
|
3
|
Chakraborty S, Britton M, Wegrzyn J, Butterfield T, Martínez-García PJ, Reagan RL, Rao BJ, Leslie CA, Aradhaya M, Neale D, Woeste K, Dandekar AM. YeATS - a tool suite for analyzing RNA-seq derived transcriptome identifies a highly transcribed putative extensin in heartwood/sapwood transition zone in black walnut. F1000Res 2015; 4:155. [PMID: 26870317 PMCID: PMC4732554 DOI: 10.12688/f1000research.6617.2] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/30/2015] [Indexed: 11/20/2022] Open
Abstract
The transcriptome provides a functional footprint of the genome by enumerating the molecular components of cells and tissues. The field of transcript discovery has been revolutionized through high-throughput mRNA sequencing (RNA-seq). Here, we present a methodology that replicates and improves existing methodologies, and implements a workflow for error estimation and correction followed by genome annotation and transcript abundance estimation for RNA-seq derived transcriptome sequences (YeATS - Yet Another Tool Suite for analyzing RNA-seq derived transcriptome). A unique feature of YeATS is the upfront determination of the errors in the sequencing or transcript assembly process by analyzing open reading frames of transcripts. YeATS identifies transcripts that have not been merged, result in broken open reading frames or contain long repeats as erroneous transcripts. We present the YeATS workflow using a representative sample of the transcriptome from the tissue at the heartwood/sapwood transition zone in black walnut. A novel feature of the transcriptome that emerged from our analysis was the identification of a highly abundant transcript that had no known homologous genes (GenBank accession: KT023102). The amino acid composition of the longest open reading frame of this gene classifies this as a putative extensin. Also, we corroborated the transcriptional abundance of proline-rich proteins, dehydrins, senescence-associated proteins, and the DNAJ family of chaperone proteins. Thus, YeATS presents a workflow for analyzing RNA-seq data with several innovative features that differentiate it from existing software.
Collapse
Affiliation(s)
| | - Monica Britton
- UC Davis Genome Center Bioinformatics Core Facility, University of California, Davis, CA, 95616, USA
| | - Jill Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269, USA
| | | | | | - Russell L Reagan
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | - Basuthkar J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research, Homi Bhaba Road, Mumbai, 400, India
| | - Charles A Leslie
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | | | - David Neale
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | - Keith Woeste
- USDA Forest Service Hardwood Tree Improvement and Regeneration Center, Purdue University, West Lafayette, IN, 47907, USA
| | - Abhaya M Dandekar
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| |
Collapse
|
4
|
Chakraborty S, Britton M, Wegrzyn J, Butterfield T, Martínez-García PJ, Reagan RL, Rao BJ, Leslie CA, Aradhaya M, Neale D, Woeste K, Dandekar AM. YeATS - a tool suite for analyzing RNA-seq derived transcriptome identifies a highly transcribed putative extensin in heartwood/sapwood transition zone in black walnut. F1000Res 2015; 4:155. [PMID: 26870317 DOI: 10.12688/f1000research.6617.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/11/2015] [Indexed: 11/20/2022] Open
Abstract
The transcriptome provides a functional footprint of the genome by enumerating the molecular components of cells and tissues. The field of transcript discovery has been revolutionized through high-throughput mRNA sequencing (RNA-seq). Here, we present a methodology that replicates and improves existing methodologies, and implements a workflow for error estimation and correction followed by genome annotation and transcript abundance estimation for RNA-seq derived transcriptome sequences (YeATS - Yet Another Tool Suite for analyzing RNA-seq derived transcriptome). A unique feature of YeATS is the upfront determination of the errors in the sequencing or transcript assembly process by analyzing open reading frames of transcripts. YeATS identifies transcripts that have not been merged, result in broken open reading frames or contain long repeats as erroneous transcripts. We present the YeATS workflow using a representative sample of the transcriptome from the tissue at the heartwood/sapwood transition zone in black walnut. A novel feature of the transcriptome that emerged from our analysis was the identification of a highly abundant transcript that had no known homologous genes (GenBank accession: KT023102). The amino acid composition of the longest open reading frame of this gene classifies this as a putative extensin. Also, we corroborated the transcriptional abundance of proline-rich proteins, dehydrins, senescence-associated proteins, and the DNAJ family of chaperone proteins. Thus, YeATS presents a workflow for analyzing RNA-seq data with several innovative features that differentiate it from existing software.
Collapse
Affiliation(s)
| | - Monica Britton
- UC Davis Genome Center Bioinformatics Core Facility, University of California, Davis, CA, 95616, USA
| | - Jill Wegrzyn
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, 06269, USA
| | | | | | - Russell L Reagan
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | - Basuthkar J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research, Homi Bhaba Road, Mumbai, 400, India
| | - Charles A Leslie
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | | | - David Neale
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | - Keith Woeste
- USDA Forest Service Hardwood Tree Improvement and Regeneration Center, Purdue University, West Lafayette, IN, 47907, USA
| | - Abhaya M Dandekar
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| |
Collapse
|
5
|
Li D, Shao F, Lu S. Identification and characterization of mRNA-like noncoding RNAs in Salvia miltiorrhiza. PLANTA 2015; 241:1131-43. [PMID: 25601000 DOI: 10.1007/s00425-015-2246-z] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2014] [Accepted: 01/09/2015] [Indexed: 05/20/2023]
Abstract
Identification and characterization of 5,446 mlncRNAs from Salvia miltiorrhiza showed that the majority of identified mlncRNAs were stress responsive, providing a framework for elucidating mlncRNA functions in S. miltiorrhiza. mRNA-like noncoding RNAs (mlncRNAs) are transcribed by RNA polymerase II and are polyadenylated, capped and spliced. They play important roles in plant development and defense responses. However, there is no information available for mlncRNAs in Salvia miltiorrhiza Bunge, the first Chinese medicinal material entering the international market. To perform a transcriptome-wide identification of S. miltiorrhiza mlncRNAs, we assembled over 8 million RNA-seq reads from GenBank database and 5,624 ESTs from PlantGDB into 44422 unigenes. Using a computational identification pipeline, we identified 5446 S. miltiorrhiza mlncRNA candidates from the assembled unigenes. Of the 5446 mlncRNAs, 2 are primary transcripts of conserved miRNAs, and 2030 can be grouped into 470 families with at least two members in a family. Quantitative real-time PCR analysis of mlncRNAs with at least 900 nt showed that the majority were differentially expressed in roots, stems, leaves and flowers and responsive to methyl jasmonate (MeJA) treatment in S. miltiorrhiza. Analysis of published RNA-seq data showed that a total of 3,044 mlncRNAs were expressed in hairy roots of S. miltiorrhiza and the expression of 1,904 of the 3,044 mlncRNAs was altered by yeast extract and Ag(+) treatment. The results indicate that the majority of mlncRNAs are involved in plant response to stress. This study provides a framework for understanding the roles of mlncRNAs in S. miltiorrhiza.
Collapse
Affiliation(s)
- Dongqiao Li
- Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China,
| | | | | |
Collapse
|
6
|
Hao Z, Fan C, Cheng T, Su Y, Wei Q, Li G. Genome-wide identification, characterization and evolutionary analysis of long intergenic noncoding RNAs in cucumber. PLoS One 2015; 10:e0121800. [PMID: 25799544 PMCID: PMC4370693 DOI: 10.1371/journal.pone.0121800] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2014] [Accepted: 02/04/2015] [Indexed: 12/25/2022] Open
Abstract
Long intergenic noncoding RNAs (lincRNAs) are intergenic transcripts with a length of at least 200 nt that lack coding potential. Emerging evidence suggests that lincRNAs from animals participate in many fundamental biological processes. However, the systemic identification of lincRNAs has been undertaken in only a few plants. We chose to use cucumber (Cucumis sativus) as a model to analyze lincRNAs due to its importance as a model plant for studying sex differentiation and fruit development and the rich genomic and transcriptome data available. The application of a bioinformatics pipeline to multiple types of gene expression data resulted in the identification and characterization of 3,274 lincRNAs. Next, 10 lincRNAs targeted by 17 miRNAs were also explored. Based on co-expression analysis between lincRNAs and mRNAs, 94 lincRNAs were annotated, which may be involved in response to stimuli, multi-organism processes, reproduction, reproductive processes, and growth. Finally, examination of the evolution of lincRNAs showed that most lincRNAs are under purifying selection, while 16 lincRNAs are under natural selection. Our results provide a rich resource for further validation of cucumber lincRNAs and their function. The identification of lincRNAs targeted by miRNAs offers new clues for investigations into the role of lincRNAs in regulating gene expression. Finally, evaluation of the lincRNAs suggested that some lincRNAs are under positive and balancing selection.
Collapse
Affiliation(s)
- Zhiqiang Hao
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
- Co-Innovation Center for Qinba Regions’ Sustainable Development, College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Chunyan Fan
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Tian Cheng
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Ya Su
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Qiang Wei
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
| | - Guanglin Li
- College of Life Sciences, Shaanxi Normal University, Xi’an, China
- Co-Innovation Center for Qinba Regions’ Sustainable Development, College of Life Sciences, Shaanxi Normal University, Xi’an, China
- * E-mail:
| |
Collapse
|
7
|
Li J, Wu B, Xu J, Liu C. Genome-wide identification and characterization of long intergenic non-coding RNAs in Ganoderma lucidum. PLoS One 2014; 9:e99442. [PMID: 24932683 PMCID: PMC4059649 DOI: 10.1371/journal.pone.0099442] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 05/14/2014] [Indexed: 12/26/2022] Open
Abstract
Ganoderma lucidum is a white-rot fungus best-known for its medicinal activities. We have previously sequenced its genome and annotated the protein coding genes. However, long non-coding RNAs in G. lucidum genome have not been analyzed. In this study, we have identified and characterized long intergenic non-coding RNAs (lincRNA) in G. lucidum systematically. We developed a computational pipeline, which was used to analyze RNA-Seq data derived from G. lucidum samples collected from three developmental stages. A total of 402 lincRNA candidates were identified, with an average length of 609 bp. Analysis of their adjacent protein-coding genes (apcGenes) revealed that 46 apcGenes belong to the pathways of triterpenoid biosynthesis and lignin degradation, or families of cytochrome P450, mating type B genes, and carbohydrate-active enzymes. To determine if lincRNAs and these apcGenes have any interactions, the corresponding pairs of lincRNAs and apcGenes were analyzed in detail. We developed a modified 3' RACE method to analyze the transcriptional direction of a transcript. Among the 46 lincRNAs, 37 were found unidirectionally transcribed, and 9 were found bidirectionally transcribed. The expression profiles of 16 of these 37 lincRNAs were found to be highly correlated with those of the apcGenes across the three developmental stages. Among them, 11 are positively correlated (r>0.8) and 5 are negatively correlated (r<-0.8). The co-localization and co-expression of lincRNAs and those apcGenes playing important functions is consistent with the notion that lincRNAs might be important regulators for cellular processes. In summary, this represents the very first study to identify and characterize lincRNAs in the genomes of basidiomycetes. The results obtained here have laid the foundation for study of potential lincRNA-mediated expression regulation of genes in G. lucidum.
Collapse
MESH Headings
- Chromosome Mapping
- Chromosomes, Fungal/genetics
- Computational Biology/methods
- Fungal Proteins/genetics
- Gene Expression Profiling
- Gene Expression Regulation, Fungal/genetics
- Genes, Fungal
- Genome, Fungal
- Lignin/metabolism
- Mycelium/physiology
- Polymerase Chain Reaction/methods
- RNA, Fungal/genetics
- RNA, Fungal/isolation & purification
- RNA, Long Noncoding/genetics
- RNA, Long Noncoding/isolation & purification
- Reishi/genetics
- Reishi/growth & development
- Reishi/metabolism
- Sequence Analysis, RNA
- Transcription, Genetic
- Triterpenes/metabolism
Collapse
Affiliation(s)
- Jianqin Li
- Center of Bioinformatics, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, P. R. China
| | - Bin Wu
- Center of Bioinformatics, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, P. R. China
| | - Jiang Xu
- Center of Bioinformatics, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, P. R. China
| | - Chang Liu
- Center of Bioinformatics, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, P. R. China
| |
Collapse
|
8
|
Song JH, Cao JS, Wang CG. BcMF11, a novel non-coding RNA gene from Brassica campestris, is required for pollen development and male fertility. PLANT CELL REPORTS 2013; 32:21-30. [PMID: 23064614 DOI: 10.1007/s00299-012-1337-6] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2012] [Revised: 08/09/2012] [Accepted: 08/17/2012] [Indexed: 05/23/2023]
Abstract
KEY MESSAGE : BcMF11 as a non-coding RNA gene has an essential role in pollen development, and might be useful for regulating the pollen fertility of crops by antisense RNA technology. We previously identified a 828-bp full-length cDNA of BcMF11, a novel pollen-specific non-coding mRNA-like gene from Chinese cabbage (Brassica campestris L. ssp. chinensis Makino). However, little information is known about the function of BcMF11 in pollen development. To investigate its exact biological roles in pollen development, the BcMF11 cDNA was antisense inhibited in transgenic Chinese cabbage under the control of a tapetum-specific promoter BcA9 and a constitutive promoter CaMV 35S. Antisense RNA transgenic plants displayed decreasing expression of BcMF11 and showed distinct morphological defects. Pollen germination test in vitro and in vivo of the transgenic plants suggested that inhibition of BcMF11 decreased pollen germination efficiency and delayed the pollen tubes' extension in the style. Under scanning electron microscopy, many shrunken and collapsed pollen grains were detected in the antisense BcMF11 transgenic Chinese cabbage. Further cytological observation revealed abnormal pollen development process in transgenic plants, including delayed degradation of tapetum, asynchronous separation of microspore, and aborted development of pollen grain. These results suggest that BcMF11, as a non-coding RNA, plays an essential role in pollen development and male fertility.
Collapse
Affiliation(s)
- Jiang-Hua Song
- College of Horticulture, Anhui Agricultural University, Hefei 230036, PR China
| | | | | |
Collapse
|
9
|
Richardson CR, Luo QJ, Gontcharova V, Jiang YW, Samanta M, Youn E, Rock CD. Analysis of antisense expression by whole genome tiling microarrays and siRNAs suggests mis-annotation of Arabidopsis orphan protein-coding genes. PLoS One 2010; 5:e10710. [PMID: 20520764 PMCID: PMC2877095 DOI: 10.1371/journal.pone.0010710] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2009] [Accepted: 04/26/2010] [Indexed: 11/22/2022] Open
Abstract
Background MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20–22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis ‘orphan’ hypothetical genes are non-coding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the “ancient” (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for “new” rapidly-evolving MIRNA genes. Conclusions Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation.
Collapse
Affiliation(s)
- Casey R. Richardson
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| | - Qing-Jun Luo
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| | - Viktoria Gontcharova
- Department of Computer Science, Texas Tech University, Lubbock, Texas, United States of America
| | - Ying-Wen Jiang
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
| | - Manoj Samanta
- Systemix Institute, Redmond, Washington, United States of America
| | - Eunseog Youn
- Department of Computer Science, Texas Tech University, Lubbock, Texas, United States of America
| | - Christopher D. Rock
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
- * E-mail:
| |
Collapse
|
10
|
Rymarquis LA, Kastenmayer JP, Hüttenhofer AG, Green PJ. Diamonds in the rough: mRNA-like non-coding RNAs. TRENDS IN PLANT SCIENCE 2008; 13:329-34. [PMID: 18448381 DOI: 10.1016/j.tplants.2008.02.009] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2008] [Revised: 02/19/2008] [Accepted: 02/21/2008] [Indexed: 05/20/2023]
Abstract
Non-coding RNAs are increasingly being identified as crucial regulators of gene expression and other cellular functions in plants. Experimental and computational methods have revealed the existence of mRNA-like non-coding RNAs (mlncRNAs), a class of non-coding RNAs that, in plants, are associated with tissue-specific expression, development and the phosphate-starvation response. Although their mechanisms of action are largely unknown, one can speculate that mlncRNAs act through secondary structures or specific sequences that bind to proteins or metabolites, or that have catalytic activity. This review summarizes the computational methods developed to identify candidate mlncRNAs, and the current experimental evidence regarding the function of several known mlncRNAs.
Collapse
Affiliation(s)
- Linda A Rymarquis
- Delaware Biotechnology Institute, University of Delaware, 15 Innovation Way, Newark, DE 19711, USA
| | | | | | | |
Collapse
|
11
|
Song JH, Cao JS, Yu XL, Xiang X. BcMF11, a putative pollen-specific non-coding RNA from Brassica campestris ssp. chinensis. JOURNAL OF PLANT PHYSIOLOGY 2007; 164:1097-100. [PMID: 17207554 DOI: 10.1016/j.jplph.2006.10.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2006] [Revised: 10/07/2006] [Accepted: 10/09/2006] [Indexed: 05/13/2023]
Abstract
A full-length cDNA, BcMF11, has been cloned from Chinese cabbage (Brassica campestris L. ssp. chinensis Makino) using rapid amplification of the cDNA ends (RACE) based on a pollen-specific cDNA fragment (DN237921). The BcMF11 cDNA has a total length of 828bp with poly (A) tail. Analysis of the sequence demonstrated that BcMF11 is a novel non-coding RNA which has no prominent open reading frame (ORF) or coding capacity. No significant similarities were observed between BcMF11 and previously published sequences in GenBank. Transcription analysis indicated that BcMF11 is a novel pollen-specific ncRNA and may be involved in the pollen development of Chinese cabbage.
Collapse
Affiliation(s)
- Jiang-Hua Song
- Laboratory of Cell and Molecular Biology, Institute of Vegetable Science, Zhejiang University, Hangzhou 310029, PR China
| | | | | | | |
Collapse
|
12
|
Fu X, Ravindranath L, Tran N, Petrovics G, Srivastava S. Regulation of apoptosis by a prostate-specific and prostate cancer-associated noncoding gene, PCGEM1. DNA Cell Biol 2006; 25:135-41. [PMID: 16569192 DOI: 10.1089/dna.2006.25.135] [Citation(s) in RCA: 173] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
PCGEM1 is a prostate tissue-specific, and prostate cancer-associated noncoding RNA (ncRNA) gene. Previous results revealed a significant association of elevated PCGEM1 expression levels in prostate cancer cells of African-American patients, whose mortality rate is the highest among prostate cancer patients. Functional study of PCGEM1 demonstrated a marked increase in colony formation in LNCaP prostate cancer cells and NIH3T3 mouse fibroblast cells. This study demonstrates that PCGEM1 overexpression in LNCaP cell culture model results in the inhibition of apoptosis induced by doxorubicin (DOX). Induction of p53 and p21(Waf1/Cip1) by DOX were delayed in LNCaP cells stably overexpressing PCGEM1 (LNCaP-PCGEM1 cells) compared to control LNCaP cells. The protein levels of cleaved caspase 7, and cleaved PARP were attenuated in DOXtreated LNCaP-PCGEM1 cells compared to control LNCaP cells. Similar results were observed in LNCaP cells transiently overexpressing PCGEM1. The inhibition of PARP cleavage by PCGEM1 overexpression was also observed in LNCaP-PCGEM1 cells incubated with etoposide and sodium selenite. Fluorescence-Activated Cell Sorter Annexin-V analysis revealed significantly lower percentage of apoptotic cells in DOX-treated LNCaP-PCGEM1 cells compared to control LNCaP cells. The attenuation of apoptic response appears to be androgen dependent in this experimental model, as androgen-independent variants of LNCaP cells did not exhibit this response. In summary, this study provides new insights into cell biologic functions and novel features of an ncRNA. Further, these data unravel biological mechanisms of cell growth/cell survival-associated functions of this ncRNA in a widely used prostate cancer cell culture model.
Collapse
Affiliation(s)
- Xiaoqin Fu
- Department of Surgery, Center for Prostate Disease Research (CPDR), U.S. Military Cancer Institute, Uniformed Services University of the Health Sciences, Bethesda, MD 20852, USA
| | | | | | | | | |
Collapse
|
13
|
Fukai E, Fujimoto R, Nishio T. Genomic organization of the S core region and the S flanking regions of a class-II S haplotype in Brassica rapa. Mol Genet Genomics 2003; 269:361-9. [PMID: 12684882 DOI: 10.1007/s00438-003-0844-0] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2002] [Accepted: 03/19/2003] [Indexed: 10/26/2022]
Abstract
The nucleotide sequence of an 86.4-kb region that includes the SP11, SRK, and SLG genes of Brassica rapa S-60 (a class-II S haplotype) was determined. In the sequenced region, 13 putative genes were found besides SP11-60, SRK-60, and SLG-60. Five of these sequences were isolated as cDNAs, five were homologues of known genes, cDNAs, or ORFs, and three are hypothetical ORFs. Based on their nucleotide sequences, however, some of them are thought to be non-functional. Two regions of colinearity between the class-II S-60 and Brassica class-I S haplotypes were identified, i.e., S flanking region 1 which shows partial colinearity of non-genic sequences and S flanking region 2 which shows a high level of colinearity. The observed colinearity made it possible to compare the order of SP-11, SRK, and SLG genes in the S locus between the five sequenced S haplotypes. It emerged that the order of SRK and SLG in class-II S-60 is the reverse of that in the four class-I S haplotypes reported so far, and the order of SP11, SRK and SLG is the opposite of that in the class-I haplotype S-910. The possible gene designated as SAN1 (S locus Anther-expressed Non-coding RNA like-1), which is located in the region between SP11-60 and SRK-60, has features reminiscent of genes for non-coding RNAs (ncRNAs), but no homologous sequences were found in the databases. This sequence is transcribed in anthers but not in stigmas or leaves. These features of the genomic structure of S-60 are discussed with special reference to the characteristics of class-II S haplotypes.
Collapse
Affiliation(s)
- E Fukai
- Graduate School of Agricultural Science, Tohoku University, Aoba-ku, 981-8555, Sendai, Japan
| | | | | |
Collapse
|
14
|
Abstract
Non-coding ribonucleic acids (RNAs) do not contain a peptide-encoding open reading frame and are therefore not translated into proteins. They are expressed in all phyla, and in eukaryotic cells they are found in the nucleus, cytoplasm, and mitochondria. Non-coding RNAs either can exert structural functions, as do transfer and ribosomal RNAs, or they can regulate gene expression. Non-coding RNAs with regulatory functions differ in size ranging from a few nucleotides to over 100 kb and have diverse cell- or development-specific functions. Some of the non-coding RNAs associate with human diseases. This chapter summarizes the current knowledge about regulatory non-coding RNAs.
Collapse
Affiliation(s)
- Uwe Michel
- Department of Neurology, Laboratory of Neurobiology, Göttingen, Germany
| |
Collapse
|
15
|
Abstract
A variety of RNA molecules have been found over the last 20 years to have a remarkable range of functions beyond the well-known roles of messenger, ribosomal and transfer RNAs. Here, we present a general categorization of all non-coding RNAs and briefly discuss the ones that affect transcription, translation and protein function.
Collapse
Affiliation(s)
- Maciej Szymański
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland
| | - Jan Barciszewski
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland
| |
Collapse
|
16
|
MacIntosh GC, Wilkerson C, Green PJ. Identification and analysis of Arabidopsis expressed sequence tags characteristic of non-coding RNAs. PLANT PHYSIOLOGY 2001; 127:765-776. [PMID: 11706161 DOI: 10.1104/pp.010501] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Sequencing of the Arabidopsis genome has led to the identification of thousands of new putative genes based on the predicted proteins they encode. Genes encoding tRNAs, ribosomal RNAs, and small nucleolar RNAs have also been annotated; however, a potentially important class of genes has largely escaped previous annotation efforts. These genes correspond to RNAs that lack significant open reading frames and encode RNA as their final product. Accumulating evidence indicates that such "non-coding RNAs" (ncRNAs) can play critical roles in a wide range of cellular processes, including chromosomal silencing, transcriptional regulation, developmental control, and responses to stress. Approximately 15 putative Arabidopsis ncRNAs have been reported in the literature or have been annotated. Although several have homologs in other plant species, all appear to be plant specific, with the exception of signal recognition particle RNA. Conversely, none of the ncRNAs reported from yeast or animal systems have homologs in Arabidopsis or other plants. To identify additional genes that are likely to encode ncRNAs, we used computational tools to filter protein-coding genes from genes corresponding to 20,000 expressed sequence tag clones. Using this strategy, we identified 19 clones with characteristics of ncRNAs, nine putative peptide-coding RNAs with open reading frames smaller than 100 amino acids, and 11 that could not be differentiated between the two categories. Again, none of these clones had homologs outside the plant kingdom, suggesting that most Arabidopsis ncRNAs are likely plant specific. These data indicate that ncRNAs represent a significant and underdeveloped aspect of Arabidopsis genomics that deserves further study.
Collapse
MESH Headings
- Algorithms
- Amino Acid Sequence
- Arabidopsis/genetics
- Arabidopsis/physiology
- Cloning, Molecular
- Cytokinins/physiology
- Databases, Nucleic Acid
- Expressed Sequence Tags
- Gene Expression Regulation, Plant
- Gene Silencing
- Internet
- Molecular Sequence Data
- Open Reading Frames
- Phosphates/physiology
- RNA, Plant/analysis
- RNA, Plant/genetics
- RNA, Plant/physiology
- RNA, Untranslated/genetics
- RNA, Untranslated/physiology
- Saccharomyces cerevisiae/genetics
- Saccharomyces cerevisiae/physiology
- Sequence Analysis, Protein
- Species Specificity
- Transcription, Genetic
Collapse
Affiliation(s)
- G C MacIntosh
- Department of Energy Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, USA
| | | | | |
Collapse
|
17
|
Mattick JS. Non-coding RNAs: the architects of eukaryotic complexity. EMBO Rep 2001; 2:986-91. [PMID: 11713189 PMCID: PMC1084129 DOI: 10.1093/embo-reports/kve230] [Citation(s) in RCA: 551] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2001] [Revised: 09/10/2001] [Accepted: 09/11/2001] [Indexed: 11/14/2022] Open
Abstract
Around 98% of all transcriptional output in humans is non-coding RNA. RNA-mediated gene regulation is widespread in higher eukaryotes and complex genetic phenomena like RNA interference, co-suppression, transgene silencing, imprinting, methylation, and possibly position-effect variegation and transvection, all involve intersecting pathways based on or connected to RNA signaling. I suggest that the central dogma is incomplete, and that intronic and other non-coding RNAs have evolved to comprise a second tier of gene expression in eukaryotes, which enables the integration and networking of complex suites of gene activity. Although proteins are the fundamental effectors of cellular function, the basis of eukaryotic complexity and phenotypic variation may lie primarily in a control architecture composed of a highly parallel system of trans-acting RNAs that relay state information required for the coordination and modulation of gene expression, via chromatin remodeling, RNA-DNA, RNA-RNA and RNA-protein interactions. This system has interesting and perhaps informative analogies with small world networks and dataflow computing.
Collapse
Affiliation(s)
- J S Mattick
- ARC Special Research Centre for Functional and Applied Genomics, Institute for Molecular Bioscience, University of Queensland, Brisbane 4072, Australia.
| |
Collapse
|
18
|
Sousa C, Johansson C, Charon C, Manyani H, Sautter C, Kondorosi A, Crespi M. Translational and structural requirements of the early nodulin gene enod40, a short-open reading frame-containing RNA, for elicitation of a cell-specific growth response in the alfalfa root cortex. Mol Cell Biol 2001; 21:354-66. [PMID: 11113209 PMCID: PMC88808 DOI: 10.1128/mcb.21.1.354-366.2001] [Citation(s) in RCA: 85] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A diversity of mRNAs containing only short open reading frames (sORF-RNAs; encoding less than 30 amino acids) have been shown to be induced in growth and differentiation processes. The early nodulin gene enod40, coding for a 0.7-kb sORF-RNA, is expressed in the nodule primordium developing in the root cortex of leguminous plants after infection by symbiotic bacteria. Ballistic microtargeting of this gene into Medicago roots induced division of cortical cells. Translation of two sORFs (I and II, 13 and 27 amino acids, respectively) present in the conserved 5' and 3' regions of enod40 was required for this biological activity. These sORFs may be translated in roots via a reinitiation mechanism. In vitro translation products starting from the ATG of sORF I were detectable by mutating enod40 to yield peptides larger than 38 amino acids. Deletion of a Medicago truncatula enod40 region between the sORFs, spanning a predicted RNA structure, did not affect their translation but resulted in significantly decreased biological activity. Our data reveal a complex regulation of enod40 action, pointing to a role of sORF-encoded peptides and structured RNA signals in developmental processes involving sORF-RNAs.
Collapse
MESH Headings
- Base Sequence
- Biolistics
- Cell Division
- Gene Expression Regulation, Developmental
- Gene Expression Regulation, Plant
- Gene Targeting
- Genes, Reporter/genetics
- Immunohistochemistry
- Medicago sativa/genetics
- Medicago sativa/growth & development
- Medicago sativa/metabolism
- Molecular Sequence Data
- Mutation/genetics
- Nucleic Acid Conformation
- Open Reading Frames/genetics
- Peptides/chemistry
- Peptides/genetics
- Peptides/metabolism
- Plant Growth Regulators/biosynthesis
- Plant Growth Regulators/chemistry
- Plant Growth Regulators/genetics
- Plant Growth Regulators/metabolism
- Plant Proteins/biosynthesis
- Plant Proteins/chemistry
- Plant Proteins/genetics
- Plant Proteins/metabolism
- Plant Roots/genetics
- Plant Roots/growth & development
- Plant Roots/metabolism
- Protein Biosynthesis/genetics
- RNA, Long Noncoding
- RNA, Plant/chemistry
- RNA, Plant/genetics
- RNA, Plant/metabolism
- RNA, Untranslated/chemistry
- RNA, Untranslated/genetics
- RNA, Untranslated/metabolism
- Recombinant Fusion Proteins/biosynthesis
Collapse
Affiliation(s)
- C Sousa
- Institut des Sciences Végétales, Centre National de la Recherche Scientifique, F-91198 Gif-sur-Yvette Cedex, France
| | | | | | | | | | | | | |
Collapse
|
19
|
Erdmann VA, Szymanski M, Hochberg A, de Groot N, Barciszewski J. Collection of mRNA-like non-coding RNAs. Nucleic Acids Res 1999; 27:192-5. [PMID: 9847177 PMCID: PMC148132 DOI: 10.1093/nar/27.1.192] [Citation(s) in RCA: 62] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
In last few years much data has accumulated which shows that in different cells various RNA transcripts are synthesized. They lack protein coding capacity and do not produce mature protein. It seems that they work mainly or exclusively on the RNA level. Their function and mechanism of action is poorly understood. In this paper we have collected all known RNA transcript and prepared a database for further structural and functional studies. This is the first collection of the nucleotide sequences of RNAs of this kind. The data can be accessed via WWW at: http://www.man.poznan.pl/5SData/ncRNA/inde x.html
Collapse
Affiliation(s)
- V A Erdmann
- Institut fur Biochemie, Freie Universitat Berlin, Thielallee 63, 14195 Berlin, Germany
| | | | | | | | | |
Collapse
|
20
|
Franssen HJ. Plants embrace a stepchild: the discovery of peptide growth regulators. CURRENT OPINION IN PLANT BIOLOGY 1998; 1:384-387. [PMID: 10066615 DOI: 10.1016/s1369-5266(98)80260-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Over the past decade, peptides have been added to the collection of signalling molecules in plants. As the impact of peptide hormones in non-plants is enormous, a comparison of plant and non-plant peptide signal molecules at this stage deserves our attention-not only to reveal common and unique features, but also to point to new avenues of future research on plant hormones.
Collapse
Affiliation(s)
- H J Franssen
- Department of Molecular Biology, Agricultural University, Dreijenlaan 3, 6703HA Wageningen, The Netherlands.
| |
Collapse
|