1
|
Lee U, Mozeika SM, Zhao L. A Synergistic, Cultivator Model of De Novo Gene Origination. Genome Biol Evol 2024; 16:evae103. [PMID: 38748819 PMCID: PMC11152449 DOI: 10.1093/gbe/evae103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2024] [Indexed: 06/07/2024] Open
Abstract
The origin and fixation of evolutionarily young genes is a fundamental question in evolutionary biology. However, understanding the origins of newly evolved genes arising de novo from noncoding genomic sequences is challenging. This is partly due to the low likelihood that several neutral or nearly neutral mutations fix prior to the appearance of an important novel molecular function. This issue is particularly exacerbated in large effective population sizes where the effect of drift is small. To address this problem, we propose a regulation-focused, cultivator model for de novo gene evolution. This cultivator-focused model posits that each step in a novel variant's evolutionary trajectory is driven by well-defined, selectively advantageous functions for the cultivator genes, rather than solely by the de novo genes, emphasizing the critical role of genome organization in the evolution of new genes.
Collapse
Affiliation(s)
- UnJin Lee
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, USA
| | - Shawn M Mozeika
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, USA
| | - Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, USA
| |
Collapse
|
2
|
Carpinteyro-Ponce J, Machado CA. The Complex Landscape of Structural Divergence Between the Drosophila pseudoobscura and D. persimilis Genomes. Genome Biol Evol 2024; 16:evae047. [PMID: 38482945 PMCID: PMC10980976 DOI: 10.1093/gbe/evae047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2024] [Indexed: 04/01/2024] Open
Abstract
Structural genomic variants are key drivers of phenotypic evolution. They can span hundreds to millions of base pairs and can thus affect large numbers of genetic elements. Although structural variation is quite common within and between species, its characterization depends upon the quality of genome assemblies and the proportion of repetitive elements. Using new high-quality genome assemblies, we report a complex and previously hidden landscape of structural divergence between the genomes of Drosophila persimilis and D. pseudoobscura, two classic species in speciation research, and study the relationships among structural variants, transposable elements, and gene expression divergence. The new assemblies confirm the already known fixed inversion differences between these species. Consistent with previous studies showing higher levels of nucleotide divergence between fixed inversions relative to collinear regions of the genome, we also find a significant overrepresentation of INDELs inside the inversions. We find that transposable elements accumulate in regions with low levels of recombination, and spatial correlation analyses reveal a strong association between transposable elements and structural variants. We also report a strong association between differentially expressed (DE) genes and structural variants and an overrepresentation of DE genes inside the fixed chromosomal inversions that separate this species pair. Interestingly, species-specific structural variants are overrepresented in DE genes involved in neural development, spermatogenesis, and oocyte-to-embryo transition. Overall, our results highlight the association of transposable elements with structural variants and their importance in driving evolutionary divergence.
Collapse
Affiliation(s)
| | - Carlos A Machado
- Department of Biology, University of Maryland, College Park, MD, USA
| |
Collapse
|
3
|
Noncoding RNA Regulation of Hormonal and Metabolic Systems in the Fruit Fly Drosophila. Metabolites 2023; 13:metabo13020152. [PMID: 36837772 PMCID: PMC9967906 DOI: 10.3390/metabo13020152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 01/12/2023] [Accepted: 01/16/2023] [Indexed: 01/22/2023] Open
Abstract
The importance of RNAs is commonly recognised thanks to protein-coding RNAs, whereas non-coding RNAs (ncRNAs) were conventionally regarded as 'junk'. In the last decade, ncRNAs' significance and roles are becoming noticeable in various biological activities, including those in hormonal and metabolic regulation. Among the ncRNAs: microRNA (miRNA) is a small RNA transcript with ~20 nucleotides in length; long non-coding RNA (lncRNA) is an RNA transcript with >200 nucleotides; and circular RNA (circRNA) is derived from back-splicing of pre-mRNA. These ncRNAs can regulate gene expression levels at epigenetic, transcriptional, and post-transcriptional levels through various mechanisms in insects. A better understanding of these crucial regulators is essential to both basic and applied entomology. In this review, we intend to summarise and discuss the current understanding and knowledge of miRNA, lncRNA, and circRNA in the best-studied insect model, the fruit fly Drosophila.
Collapse
|
4
|
Wiberg RAW, Veltsos P, Snook RR, Ritchie MG. Experimental evolution supports signatures of sexual selection in genomic divergence. Evol Lett 2021; 5:214-229. [PMID: 34136270 PMCID: PMC8190450 DOI: 10.1002/evl3.220] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 02/13/2021] [Accepted: 02/14/2021] [Indexed: 12/13/2022] Open
Abstract
Comparative genomics has contributed to the growing evidence that sexual selection is an important component of evolutionary divergence and speciation. Divergence by sexual selection is implicated in faster rates of divergence of the X chromosome and of genes thought to underlie sexually selected traits, including genes that are sex biased in expression. However, accurately inferring the relative importance of complex and interacting forms of natural selection, demography, and neutral processes that occurred in the evolutionary past is challenging. Experimental evolution provides an opportunity to apply controlled treatments for multiple generations and examine the consequent genomic divergence. Here, we altered sexual selection intensity, elevating sexual selection in polyandrous lines and eliminating it in monogamous lines, and examined patterns of allele frequency divergence in the genome of Drosophila pseudoobscura after more than 160 generations of experimental evolution. Divergence is not uniform across the genome but concentrated in "islands," many of which contain candidate genes implicated in mating behaviors and other sexually selected phenotypes. These are more often seen on the X chromosome, which also shows greater divergence in F ST than neutral expectations. There are characteristic signatures of selection seen in these regions, with lower diversity on the X chromosome than the autosomes, and differences in diversity on the autosomes between selection regimes. Reduced Tajima's D within some of the divergent regions may imply that selective sweeps have occurred, despite considerable recombination. These changes are associated with both differential gene expression between the lines and sex-biased gene expression within the lines. Our results are very similar to those thought to implicate sexual selection in divergence between species and natural populations, and hence provide experimental support for the likely role of sexual selection in driving such types of genetic divergence, but also illustrate how variable outcomes can be for different genomic regions.
Collapse
Affiliation(s)
- R. Axel W. Wiberg
- Centre for Biological DiversityUniversity of St AndrewsSt AndrewsKY16 9THUnited Kingdom
- Current Address: Department of Environmental SciencesZoological InstituteUniversity of BaselBaselCH‐4051Switzerland
| | - Paris Veltsos
- Department of Ecology and Evolutionary BiologyUniversity of KansasLawrenceKansas66045
| | - Rhonda R. Snook
- Department of ZoologyStockholm UniversityStockholm106 91Sweden
| | - Michael G. Ritchie
- Centre for Biological DiversityUniversity of St AndrewsSt AndrewsKY16 9THUnited Kingdom
| |
Collapse
|
5
|
Genome-wide detection and sequence conservation analysis of long non-coding RNA during hair follicle cycle of yak. BMC Genomics 2020; 21:681. [PMID: 32998696 PMCID: PMC7528256 DOI: 10.1186/s12864-020-07082-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 09/18/2020] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Long non-coding RNA (lncRNA) as an important regulator has been demonstrated playing an indispensable role in the biological process of hair follicles (HFs) growth. However, their function and expression profile in the HFs cycle of yak are yet unknown. Only a few functional lncRNAs have been identified, partly due to the low sequence conservation and lack of identified conserved properties in lncRNAs. Here, lncRNA-seq was employed to detect the expression profile of lncRNAs during the HFs cycle of yak, and the sequence conservation of two datasets between yak and cashmere goat during the HFs cycle was analyzed. RESULTS A total of 2884 lncRNAs were identified in 5 phases (Jan., Mar., Jun., Aug., and Oct.) during the HFs cycle of yak. Then, differential expression analysis between 3 phases (Jan., Mar., and Oct.) was performed, revealing that 198 differentially expressed lncRNAs (DELs) were obtained in the Oct.-vs-Jan. group, 280 DELs were obtained in the Jan.-vs-Mar. group, and 340 DELs were obtained in the Mar.-vs-Oct. group. Subsequently, the nearest genes of lncRNAs were searched as the potential target genes and used to explore the function of DELs by GO and KEGG enrichment analysis. Several critical pathways involved in HFs development such as Wnt signaling pathway, VEGF signaling pathway, and signaling pathways regulating pluripotency of stem cells, were enriched. To further screen key lncRNAs influencing the HFs cycle, 24 DELs with differ degree of sequence conservation were obtained via a comparative analysis of partial DELs with previously published lncRNA-seq data of cashmere goat in the HFs cycle using NCBI BLAST-2.9.0+, and 3 DELs of them were randomly selected for further detailed analysis of the sequence conservation properties. CONCLUSIONS This study revealed the expression pattern and potential function of lncRNAs during HFs cycle of yak, which would expand the knowledge about the role of lncRNAs in the HFs cycle. The findings related to sequence conservation properties of lncRNAs in the HFs cycle between the two species may provide valuable insights into the study of lncRNA functionality and mechanism.
Collapse
|
6
|
Chang ZX, Ajayi OE, Guo DY, Wu QF. Genome-wide characterization and developmental expression profiling of long non-coding RNAs in Sogatella furcifera. INSECT SCIENCE 2020; 27:987-997. [PMID: 31264303 DOI: 10.1111/1744-7917.12707] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 05/16/2019] [Accepted: 06/13/2019] [Indexed: 06/09/2023]
Abstract
The genome-wide characterization of long non-coding RNA (lncRNA) in insects demonstrates their importance in fundamental biological processes. Essentially, an in-depth understanding of the functional repertoire of lncRNA in insects is pivotal to insect resources utilization and sustainable pest control. Using a custom bioinformatics pipeline, we identified 1861 lncRNAs encoded by 1852 loci in the Sogatella furcifera genome. We profiled lncRNA expression in different developmental stages and observed that the expression of lncRNAs is more highly temporally restricted compared to protein-coding genes. More up-regulated Sogatella furcifera lncRNA expressed in the embryo, 4th and 5th instars, suggesting that increased lncRNA levels may play a role in these developmental stages. We compared the relationship between the expression of Sogatella furcifera lncRNA and its nearest protein gene and found that lncRNAs were more correlated to their downstream coding neighbors on the opposite strand. Our genome-wide profiling of lncRNAs in Sogatella furcifera identifies exciting candidates for characterization of lncRNAs, and also provides information on lncRNA regulation during insect development.
Collapse
Affiliation(s)
- Zhao-Xia Chang
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Sciences, University of Science and Technology of China, Hefei, China
| | - Olugbenga Emmanuel Ajayi
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Sciences, University of Science and Technology of China, Hefei, China
| | - Dong-Yang Guo
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Sciences, University of Science and Technology of China, Hefei, China
| | - Qing-Fa Wu
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Sciences, University of Science and Technology of China, Hefei, China
- CAS Key Laboratory of Innate Immunity and Chronic Disease, University of Science and Technology of China, Hefei, China
| |
Collapse
|
7
|
Huo D, Sun L, Storey KB, Zhang L, Liu S, Sun J, Yang H. The regulation mechanism of lncRNAs and mRNAs in sea cucumbers under global climate changes: Defense against thermal and hypoxic stresses. THE SCIENCE OF THE TOTAL ENVIRONMENT 2020; 709:136045. [PMID: 31905562 PMCID: PMC7144348 DOI: 10.1016/j.scitotenv.2019.136045] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 11/18/2019] [Accepted: 12/08/2019] [Indexed: 06/10/2023]
Abstract
The aquatic environment can be greatly impacted by thermal and hypoxic stresses, particularly caused by intensified global warming. Hence, there is an urgency to understand the response mechanisms of marine organisms to adverse environment. Although long non-coding RNAs (lncRNAs) are involved in many biological processes, their roles in stress responses still remain unclear. Here, differentially expressed (DE) lncRNAs and mRNAs were identified as responses to environmental stresses in the economically important sea cucumber, Apostichopus japonicus, and their potential roles were explored. Based on a total of 159, 355 and 495 significantly upregulated genes and 230, 518 and 647 significantly downregulated genes identified in the thermal, hypoxic and combination thermal + hypoxic stress treatments, respectively, we constructed DE-lncRNA-mRNA coexpression networks. Among the networks, eight shared pairs were identified from the three treatments, and based on the connectivity degree, MSTRG.27265, MSTRG.19729 and MSTRG.95524 were shown to be crucial lncRNAs. Among all the significantly changed lncRNAs identified by RT-qPCR and sequencing data, binding sites were found in four other lncRNAs (MSTRG.34610, MSTRG.10941, MSTRG.81281 and MSTRG.93731) with Aja-miR-2013-3p, a key miRNA that responds to hypoxia in sea cucumbers. The hypoxia-inducible factor (HIF-1α) was also shown as the possible targeted mRNA of Aja-miR-2013-3p. As indicated by a dual-luciferase reporter assay system, "HIF-1α gene/Aja-miR-2013-3p/MSTRG.34610" network and the "HIF-1α gene/Aja-miR-2013-3p/MSTRG.10941" network may play important roles in sea cucumbers under environmental stresses. Moreover, environmental stress altered the expression of multiple lncRNAs and mRNAs, thus affecting various biological processes in A. japonicus, including immunity, energy metabolism and the cell cycle. At the molecular level, more comprehensive responses were elicited by the combined thermal/hypoxic stress treatment than by individual stresses alone in sea cucumbers. This study lays the groundwork for future research on molecular mechanisms of echinoderm responses to thermal and hypoxic stress in the context of global climate changes.
Collapse
Affiliation(s)
- Da Huo
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China; Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China; University of Chinese Academy of Sciences, Beijing 100049, China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao 266071, China
| | - Lina Sun
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China; Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China; University of Chinese Academy of Sciences, Beijing 100049, China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao 266071, China.
| | - Kenneth B Storey
- Department of Biology, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - Libin Zhang
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China; Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China; University of Chinese Academy of Sciences, Beijing 100049, China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao 266071, China
| | - Shilin Liu
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China; Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China; University of Chinese Academy of Sciences, Beijing 100049, China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao 266071, China
| | - Jingchun Sun
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China; Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China; University of Chinese Academy of Sciences, Beijing 100049, China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao 266071, China
| | - Hongsheng Yang
- CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China; Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China; University of Chinese Academy of Sciences, Beijing 100049, China; Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao 266071, China.
| |
Collapse
|
8
|
Guan R, Li H, Zhang H, An S. Comparative analysis of dsRNA-induced lncRNAs in three kinds of insect species. ARCHIVES OF INSECT BIOCHEMISTRY AND PHYSIOLOGY 2020; 103:e21640. [PMID: 31667893 DOI: 10.1002/arch.21640] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 09/24/2019] [Accepted: 10/10/2019] [Indexed: 06/10/2023]
Abstract
Long noncoding RNAs (lncRNAs) that have immune responses to various stimuli have been identified in some insects. One type of pathogen-associated molecular pattern, double-stranded RNA (dsRNA), can trigger the RNA interference (RNAi) pathway and immune response. Interestingly, there has been no research into characterizing the relationship between lncRNA and dsRNA-induced RNAi pathways. In this study, dsRNA-induced lncRNAs were investigated in two species of lepidopteran insects, Helicoverpa armigera and Plutella xylostella, and one species of coleopteran insects, Tribolium castaneum. Between untreated group and dsRNA-induced group; 3,463 H. armigera, 6,245 P. xylostella, and 3,067 T. castaneum differentially expressed lncRNAs were identified while 156 H. armigera, 247 P. xylostella, 415 T. castaneum lncRNAs and their putative target genes showed consistent changes in gene expression. In T. castaneum, most target genes of the differentially expressed lncRNAs are enriched in the cyclic adenosine monophosphate signaling pathway, ABC transporters, and Janus kinase-signal transducers and activators of the transcription signaling pathway. Conversely, in H. armigera and P. xylostella, the differentially expressed lncRNAs were mainly enriched in the metabolic, digestive, and synthetic signaling pathways. This result indicates that dsRNA-induced lncRNA is species-dependent. We also found that both Dicer-2 and the lncRNA that targets Dicer-2 were significantly upregulated after dsRNA treatment in P. xylostella, indicating that some lncRNAs may be involved in the regulation of the core RNAi pathway in insects. Our results are the first to identify a relationship between lncRNAs and dsRNA in various insect species with different RNAi efficiencies. These results provide a reference for future study of the dsRNA-induced RNAi pathway and different RNAi efficiencies among insect species.
Collapse
Affiliation(s)
- Ruobing Guan
- State Key Laboratory of Wheat and Maize Crop Science, College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| | - Haichao Li
- State Key Laboratory of Wheat and Maize Crop Science, College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| | - Hao Zhang
- Laboratory of Biochemistry and Molecular Biology, School of Marine Sciences, Ningbo University, Ningbo, China
| | - Shiheng An
- State Key Laboratory of Wheat and Maize Crop Science, College of Plant Protection, Henan Agricultural University, Zhengzhou, China
| |
Collapse
|
9
|
Crysnanto D, Obbard DJ. Widespread gene duplication and adaptive evolution in the RNA interference pathways of the Drosophila obscura group. BMC Evol Biol 2019; 19:99. [PMID: 31068148 PMCID: PMC6505081 DOI: 10.1186/s12862-019-1425-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 04/18/2019] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND RNA interference (RNAi) related pathways provide defense against viruses and transposable elements, and have been implicated in the suppression of meiotic drive elements. Genes in these pathways often exhibit high levels of adaptive substitution, and over longer timescales show gene duplication and loss-most likely as a consequence of their role in mediating conflict with these parasites. This is particularly striking for Argonaute 2 (Ago2), which is ancestrally the key effector of antiviral RNAi in insects, but has repeatedly formed new testis-specific duplicates in the recent history of the obscura species-group of Drosophila. RESULTS Here we take advantage of publicly available genomic and transcriptomic data to identify six further RNAi-pathway genes that have duplicated in this clade of Drosophila, and examine their evolutionary history. As seen for Ago2, we observe high levels of adaptive amino-acid substitution and changes in sex-biased expression in many of the paralogs. However, our phylogenetic analysis suggests that co-duplications of the RNAi machinery were not synchronous, and our expression analysis fails to identify consistent male-specific expression. CONCLUSIONS These results confirm that RNAi genes, including genes of the antiviral and piRNA pathways, have undergone multiple independent duplications and that their history has been particularly labile within the obscura group. However, they also suggest that the selective pressures driving these changes have not been consistent, implying that more than one selective agent may be responsible.
Collapse
Affiliation(s)
- Danang Crysnanto
- Institute of Evolutionary Biology, University of Edinburgh, Charlotte Auerbach Road, Edinburgh, UK
- Animal Genomics, ETH Zurich, Zurich, Switzerland
| | - Darren J. Obbard
- Institute of Evolutionary Biology, University of Edinburgh, Charlotte Auerbach Road, Edinburgh, UK
- Centre for Infection, Evolution and Immunity, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
10
|
Pegueroles C, Iraola-Guzmán S, Chorostecki U, Ksiezopolska E, Saus E, Gabaldón T. Transcriptomic analyses reveal groups of co-expressed, syntenic lncRNAs in four species of the genus Caenorhabditis. RNA Biol 2019; 16:320-329. [PMID: 30691342 PMCID: PMC6380332 DOI: 10.1080/15476286.2019.1572438] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 12/18/2018] [Accepted: 01/13/2019] [Indexed: 01/24/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) are a heterogeneous class of genes that do not code for proteins. Since lncRNAs (or a fraction thereof) are expected to be functional, many efforts have been dedicated to catalog lncRNAs in numerous organisms, but our knowledge of lncRNAs in non vertebrate species remains very limited. Here, we annotated lncRNAs using transcriptomic data from the same larval stage of four Caenorhabditis species. The number of annotated lncRNAs in self-fertile nematodes was lower than in out-crossing species. We used a combination of approaches to identify putatively homologous lncRNAs: synteny, sequence conservation, and structural conservation. We classified a total of 1,532 out of 7,635 genes from the four species into families of lncRNAs with conserved synteny and expression at the larval stage, suggesting that a large fraction of the predicted lncRNAs may be species specific. Despite both sequence and local secondary structure seem to be poorly conserved, sequences within families frequently shared BLASTn hits and short sequence motifs, which were more likely to be unpaired in the predicted structures. We provide the first multi-species catalog of lncRNAs in nematodes and identify groups of lncRNAs with conserved synteny and expression, that share exposed motifs.
Collapse
Affiliation(s)
- Cinta Pegueroles
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Susana Iraola-Guzmán
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Uciel Chorostecki
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Ewa Ksiezopolska
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Ester Saus
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | - Toni Gabaldón
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
11
|
Mat-Sharani S, Firdaus-Raih M. Computational discovery and annotation of conserved small open reading frames in fungal genomes. BMC Bioinformatics 2019; 19:551. [PMID: 30717662 PMCID: PMC7394265 DOI: 10.1186/s12859-018-2550-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 11/30/2018] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Small open reading frames (smORF/sORFs) that encode short protein sequences are often overlooked during the standard gene prediction process thus leading to many sORFs being left undiscovered and/or misannotated. For many genomes, a second round of sORF targeted gene prediction can complement the existing annotation. In this study, we specifically targeted the identification of ORFs encoding for 80 amino acid residues or less from 31 fungal genomes. We then compared the predicted sORFs and analysed those that are highly conserved among the genomes. RESULTS A first set of sORFs was identified from existing annotations that fitted the maximum of 80 residues criterion. A second set was predicted using parameters that specifically searched for ORF candidates of 80 codons or less in the exonic, intronic and intergenic sequences of the subject genomes. A total of 1986 conserved sORFs were predicted and characterized. CONCLUSIONS It is evident that numerous open reading frames that could potentially encode for polypeptides consisting of 80 amino acid residues or less are overlooked during standard gene prediction and annotation. From our results, additional targeted reannotation of genomes is clearly able to complement standard genome annotation to identify sORFs. Due to the lack of, and limitations with experimental validation, we propose that a simple conservation analysis can provide an acceptable means of ensuring that the predicted sORFs are sufficiently clear of gene prediction artefacts.
Collapse
Affiliation(s)
- Shuhaila Mat-Sharani
- Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.,Malaysia Genome Institute, Ministry of Science, Technology & Innovation, Jalan Bangi, 43000, Kajang, Selangor, Malaysia
| | - Mohd Firdaus-Raih
- Centre for Frontier Sciences, Faculty of Science and Technology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia. .,Institute of Systems Biology, Universiti Kebangsaan Malaysia, UKM, 43600, Bangi, Selangor, Malaysia.
| |
Collapse
|
12
|
Kirsch R, Seemann SE, Ruzzo WL, Cohen SM, Stadler PF, Gorodkin J. Identification and characterization of novel conserved RNA structures in Drosophila. BMC Genomics 2018; 19:899. [PMID: 30537930 PMCID: PMC6288889 DOI: 10.1186/s12864-018-5234-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Accepted: 11/08/2018] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome. RESULTS To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units. CONCLUSIONS This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.
Collapse
Affiliation(s)
- Rebecca Kirsch
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, D-04107 Germany
| | - Stefan E. Seemann
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
| | - Walter L. Ruzzo
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- School of Computer Science and Engineering, University of Washington, Box 352350, Seattle, 98195-2350 WA USA
- Department of Genome Sciences, University of Washington, Box 355065, Seattle, 98195-5065 WA USA
- Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, 98109-1024 WA USA
| | - Stephen M. Cohen
- Department of Cellular and Molecular Medicine, University of Copenhagen, Blegdamsvej 3, Copenhagen N, DK-2200 Denmark
| | - Peter F. Stadler
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Bioinformatics Group, Department of Computer Science, and Interdisciplinary Center for Bioinformatics, Universität Leipzig, Härtelstraße 16–18, Leipzig, D-04107 Germany
- Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, Leipzig, D-04103 Germany
- Faculdad de Ciencias, Universidad Nacional de Colombia, Sede Bogotá, Ciudad Universitaria, Bogotá, COL-111321 D.C. Colombia
- Department of Theoretical Chemistry, University of Vienna, Währinger Straße 17, Vienna, A-1090 Austria
- Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM87501 USA
| | - Jan Gorodkin
- Center for non-coding RNA in Technology and Health, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
- Department of Veterinary and Animal Science, University of Copenhagen, Grønnegårdsvej 3, Frederiksberg C, DK-1870 Denmark
| |
Collapse
|
13
|
Schor IE, Bussotti G, Maleš M, Forneris M, Viales RR, Enright AJ, Furlong EEM. Non-coding RNA Expression, Function, and Variation during Drosophila Embryogenesis. Curr Biol 2018; 28:3547-3561.e9. [PMID: 30393032 PMCID: PMC6264527 DOI: 10.1016/j.cub.2018.09.026] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 07/10/2018] [Accepted: 09/12/2018] [Indexed: 01/02/2023]
Abstract
Long non-coding RNAs (lncRNAs) can often function in the regulation of gene expression during development; however, their generality as essential regulators in developmental processes and organismal phenotypes remains unclear. Here, we performed a tailored investigation of lncRNA expression and function during Drosophila embryogenesis, interrogating multiple stages, tissue specificity, nuclear localization, and genetic backgrounds. Our results almost double the number of annotated lncRNAs expressed at these embryonic stages. lncRNA levels are generally positively correlated with those of their neighboring genes, with little evidence of transcriptional interference. Using fluorescent in situ hybridization, we report the spatiotemporal expression of 15 new lncRNAs, revealing very dynamic tissue-specific patterns. Despite this, deletion of selected lncRNA genes had no obvious developmental defects or effects on viability under standard and stressed conditions. However, two lncRNA deletions resulted in modest expression changes of a small number of genes, suggesting that they fine-tune expression of non-essential genes. Several lncRNAs have strain-specific expression, indicating that they are not fixed within the population. This intra-species variation across genetic backgrounds may thereby be a useful tool to distinguish rapidly evolving lncRNAs with as yet non-essential roles.
Collapse
Affiliation(s)
- Ignacio E Schor
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany; Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Departamento de Fisiología, Biología Molecular y Celular, FCEyN, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Giovanni Bussotti
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
| | - Matilda Maleš
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | - Mattia Forneris
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | - Rebecca R Viales
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | - Anton J Enright
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
| | - Eileen E M Furlong
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany.
| |
Collapse
|
14
|
Sun W, Feng J. Differential lncRNA expression profiles reveal the potential roles of lncRNAs in antiviral immune response of Crassostrea gigas. FISH & SHELLFISH IMMUNOLOGY 2018; 81:233-241. [PMID: 30010017 DOI: 10.1016/j.fsi.2018.07.032] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 07/10/2018] [Accepted: 07/12/2018] [Indexed: 06/08/2023]
Abstract
Long noncoding RNAs (lncRNAs) may play widespread roles in various biological processes. However, systematic profiles of lncRNAs in the biological responses of Pacific Oyster (Crassostrea gigas) to pathogen infection have not yet been demonstrated. Here, we have conducted an exhaustive comparative transcriptome analysis using a bioinformatics approach to exam the functions of lncRNAs response to Ostreid herpesvirus 1μVar (OsHV-1μVar) challenge. In total, 101 differentially expressed lncRNAs (DE-lncRNA) during OsHV-1μVar infections were identified. Compared with differentially expressed mRNAs (DE-mRNA), DE-lncRNAs are shorter in terms of overall length but longer in terms of exon length. These lncRNAs shared similar characteristics with previously reported invertebrate lncRNAs, such as relatively low GC content, low exon number and low sequence conservation, but low expression level were not observed. 20 DE-lncRNAs are typically co-expressed with their neighboring genes annotated as GO terms (GO: 0044237), indicating that these lncRNAs are involved in binding and cellular process functions in cis mode. The weighted gene co-expression network (WGCNA) analysis resulted in 15 modules. The highlighted blue module was specifically demonstrated a co-expression relationship between 14 DE-lncRNAs and 17 immune-related DE-mRNAs (IR-DE-mRNA). Three hub lncRNAs within this module were co-expressed with one hub IR-DE-mRNA involved in fibrinogen-related protein. It was speculated that lncRNAs is extensively involved in oyster antiviral innate immune system. The present study will facilitate subsequently experimental studies to unravel the function of lncRNAs in marine invertebrate response to pathogen infection.
Collapse
Affiliation(s)
- Weiming Sun
- Ocean School, Yantai University, Yantai 264005, China.
| | - Jixing Feng
- Ocean School, Yantai University, Yantai 264005, China
| |
Collapse
|
15
|
Developmental Dynamics of Long Noncoding RNA Expression during Sexual Fruiting Body Formation in Fusarium graminearum. mBio 2018; 9:mBio.01292-18. [PMID: 30108170 PMCID: PMC6094484 DOI: 10.1128/mbio.01292-18] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Long noncoding RNA (lncRNA) plays important roles in sexual development in eukaryotes. In filamentous fungi, however, little is known about the expression and roles of lncRNAs during fruiting body formation. By profiling developmental transcriptomes during the life cycle of the plant-pathogenic fungus Fusarium graminearum, we identified 547 lncRNAs whose expression was highly dynamic, with about 40% peaking at the meiotic stage. Many lncRNAs were found to be antisense to mRNAs, forming 300 sense-antisense pairs. Although small RNAs were produced from these overlapping loci, antisense lncRNAs appeared not to be involved in gene silencing pathways. Genome-wide analysis of small RNA clusters identified many silenced loci at the meiotic stage. However, we found transcriptionally active small RNA clusters, many of which were associated with lncRNAs. Also, we observed that many antisense lncRNAs and their respective sense transcripts were induced in parallel as the fruiting bodies matured. The nonsense-mediated decay (NMD) pathway is known to determine the fates of lncRNAs as well as mRNAs. Thus, we analyzed mutants defective in NMD and identified a subset of lncRNAs that were induced during sexual development but suppressed by NMD during vegetative growth. These results highlight the developmental stage-specific nature and functional potential of lncRNA expression in shaping the fungal fruiting bodies and provide fundamental resources for studying sexual stage-induced lncRNAs. Fusarium graminearum is the causal agent of the head blight on our major staple crops, wheat and corn. The fruiting body formation on the host plants is indispensable for the disease cycle and epidemics. Long noncoding RNA (lncRNA) molecules are emerging as key regulatory components for sexual development in animals and plants. To date, however, there is a paucity of information on the roles of lncRNAs in fungal fruiting body formation. Here we characterized hundreds of lncRNAs that exhibited developmental stage-specific expression patterns during fruiting body formation. Also, we discovered that many lncRNAs were induced in parallel with their overlapping transcripts on the opposite DNA strand during sexual development. Finally, we found a subset of lncRNAs that were regulated by an RNA surveillance system during vegetative growth. This research provides fundamental genomic resources that will spur further investigations on lncRNAs that may play important roles in shaping fungal fruiting bodies.
Collapse
|
16
|
Li X, Ao J, Wu J. Systematic identification and comparison of expressed profiles of lncRNAs and circRNAs with associated co-expression and ceRNA networks in mouse germline stem cells. Oncotarget 2018; 8:26573-26590. [PMID: 28404936 PMCID: PMC5432280 DOI: 10.18632/oncotarget.15719] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2016] [Accepted: 01/22/2017] [Indexed: 01/01/2023] Open
Abstract
Accumulating evidence indicates that long noncoding RNAs (lncRNAs) and circular RNAs (circRNAs) involve in germ cell development. However, little is known about the functions and mechanisms of lncRNAs and circRNAs in self-renewal and differentiation of germline stem cells. Therefore, we explored the expression profiles of mRNAs, lncRNAs, and circRNAs in male and female mouse germline stem cells by high-throughput sequencing. We identified 18573 novel lncRNAs and 18822 circRNAs in the germline stem cells and further confirmed the existence of these lncRNAs and circRNAs by RT-PCR. The results showed that male and female germline stem cells had similar GDNF signaling mechanism. Subsequently, 8115 mRNAs, 3996 lncRNAs, and 921 circRNAs exhibited sex-biased expression that may be associated with germline stem cell acquisition of the sex-specific properties required for differentiation into gametes. Gene Ontology (GO) and KEGG pathway enrichment analyses revealed different functions for these sex-biased lncRNAs and circRNAs. We further constructed correlated expression networks including coding–noncoding co-expression and competing endogenous RNAs with bioinformatics. Co-expression analysis showed hundreds of lncRNAs were correlated with sex differences in mouse germline stem cells, including lncRNA Gm11851, lncRNA Gm12840, lncRNA 4930405O22Rik, and lncRNA Atp10d. CeRNA network inferred that lncRNA Meg3 and cirRNA Igf1r could bind competitively with miRNA-15a-5p increasing target gene Inha, Acsl3, Kif21b, and Igfbp2 expressions. These findings provide novel perspectives on lncRNAs and circRNAs and lay a foundation for future research into the regulating mechanisms of lncRNAs and circRNAs in germline stem cells.
Collapse
Affiliation(s)
- Xiaoyong Li
- Renji Hospital, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Junping Ao
- State Key Laboratory of Oncogenes and Related Genes, Shanghai Cancer Institute, Renji Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200032, China
| | - Ji Wu
- Renji Hospital, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders (Ministry of Education), Bio-X Institutes, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200240, China.,Key Laboratory of Fertility Preservation and Maintenance of Ministry of Education, Ningxia Medical University, Yinchuan, 750004, China.,Shanghai Key Laboratory of Reproduction Medicine, Shanghai, 200025, China
| |
Collapse
|
17
|
Lakhotia SC. From Heterochromatin to Long Noncoding RNAs in Drosophila: Expanding the Arena of Gene Function and Regulation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1008:75-118. [PMID: 28815537 DOI: 10.1007/978-981-10-5203-3_3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Recent years have witnessed a remarkable interest in exploring the significance of pervasive noncoding transcripts in diverse eukaryotes. Classical cytogenetic studies using the Drosophila model system unraveled the perplexing attributes and "functions" of the "gene"-poor heterochromatin. Recent molecular studies in the fly model are likewise revealing the very diverse and significant roles played by long noncoding RNAs (lncRNAs) in development, gene regulation, chromatin organization, cell and nuclear architecture, etc. There has been a rapid increase in the number of identified lncRNAs, although a much larger number still remains unknown. The diversity of modes of actions and functions of the limited number of Drosophila lncRNAs, which have been examined, already reflects the profound roles of such RNAs in generating and sustaining the biological complexities of eukaryotes. Several of the known Drosophila lncRNAs originate as independent sense or antisense transcripts from promoter or intergenic, intronic, or 5'/3'-UTR regions, while many of them are independent genes that produce only lncRNAs or coding as well as noncoding RNAs. The different lncRNAs affect chromatin organization (local or large-scale pan-chromosomal), transcription, RNA processing/stability, or translation either directly through interaction with their target DNA sequences or indirectly by acting as intermediary molecules for specific regulatory proteins or may act as decoys/sinks, or storage sites for specific proteins or groups of proteins, or may provide a structural framework for the assembly of substructures in nucleus/cytoplasm. It is interesting that many of the "functions" alluded to heterochromatin in earlier cytogenetic studies appear to find correlates with the known subtle as well as far-reaching actions of the different small and long noncoding RNAs. Further studies exploiting the very rich and powerful genetic and molecular resources available for the Drosophila model are expected to unravel the mystery underlying the long reach of ncRNAs.
Collapse
Affiliation(s)
- Subhash C Lakhotia
- Cytogenetics Laboratory, Department of Zoology, Banaras Hindu University, Varanasi, 221005, India.
| |
Collapse
|
18
|
Liu F, Guo D, Yuan Z, Chen C, Xiao H. Genome-wide identification of long non-coding RNA genes and their association with insecticide resistance and metamorphosis in diamondback moth, Plutella xylostella. Sci Rep 2017; 7:15870. [PMID: 29158595 PMCID: PMC5696462 DOI: 10.1038/s41598-017-16057-2] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Accepted: 11/06/2017] [Indexed: 01/17/2023] Open
Abstract
Long non-coding RNA (lncRNA) is a class of noncoding RNA >200 bp in length that has essential roles in regulating a variety of biological processes. Here, we constructed a computational pipeline to identify lncRNA genes in the diamondback moth (Plutella xylostella), a major insect pest of cruciferous vegetables. In total, 3,324 lncRNAs corresponding to 2,475 loci were identified from 13 RNA-Seq datasets, including samples from parasitized, insecticide-resistant strains and different developmental stages. The identified P. xylostella lncRNAs had shorter transcripts and fewer exons than protein-coding genes. Seven out of nine randomly selected lncRNAs were validated by strand-specific RT-PCR. In total, 54-172 lncRNAs were specifically expressed in the insecticide resistant strains, among which one lncRNA was located adjacent to the sodium channel gene. In addition, 63-135 lncRNAs were specifically expressed in different developmental stages, among which three lncRNAs overlapped or were located adjacent to the metamorphosis-associated genes. These lncRNAs were either strongly or weakly co-expressed with their overlapping or neighboring mRNA genes. In summary, we identified thousands of lncRNAs and presented evidence that lncRNAs might have key roles in conferring insecticide resistance and regulating the metamorphosis development in P. xylostella.
Collapse
Affiliation(s)
- Feiling Liu
- College of Life Sciences and Resource Environment, Yichun University, Yichun, 336000, China
| | - Dianhao Guo
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing, 210095, China
| | - Zhuting Yuan
- Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing, 210095, China
| | - Chen Chen
- College of Life Sciences and Resource Environment, Yichun University, Yichun, 336000, China
| | - Huamei Xiao
- College of Life Sciences and Resource Environment, Yichun University, Yichun, 336000, China.
- The Center for Translational Medicine, Yichun University, Yichun, 336000, China.
| |
Collapse
|
19
|
Dianatpour A, Ghafouri-Fard S. Long Non Coding RNA Expression Intersecting Cancer and Spermatogenesis: A Systematic Review. Asian Pac J Cancer Prev 2017; 18:2601-2610. [PMID: 29072050 PMCID: PMC5747377 DOI: 10.22034/apjcp.2017.18.10.2601] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Background: Numerous similarities have been noted between gametogenic and tumorigenic programs in features
such as global hypomethylation, immune evasion, immortalization, meiosis induction, and migration. In addition, aberrant
expression of testis specific genes has been detected in various cancers which has led to categorization of these genes
as “cancer-testis genes”. Most of the examples identified in this category are protein encoding. However, recent studies
have revealed that non-coding RNAs, including long non coding RNAs (lncRNAs), may have essential regulatory
roles in telomere biology, chromatin dynamics, modulation of gene expression and genome structural organization.
All of these functions are implicated in both gametogenic and tumorigenic programs. Methods: In the present study,
we conducted a computerized search of the MEDLINE/PUBMED and Embase databases with the key words lncRNA,
gametogenesis, testis and cancer. Results: We found a number of lncRNAs with essential roles and notable expression
in both gametogenic and cancer tissues. Conclusions: Comparison between cancer tissues and gametogenic tissues
has shown that numerous lncRNAs are expressed in both, playing similar roles in processes modulated by signaling
pathways such as Wnt/β-catenin and PI3K/AKT/mTOR. Evaluation of expression patterns and functions of these
genes should pave the way to discovery of biomarkers for early detection, prognostic assessment and evaluation of
therapeutic responses in cancers.
Collapse
Affiliation(s)
- Ali Dianatpour
- Department of Medical Genetics, Faculty of Medicine, Shahid Beheshti University of Medical sciences, Tehran, Iran.
| | | |
Collapse
|
20
|
Identification of long non-coding RNA in the horse transcriptome. BMC Genomics 2017; 18:511. [PMID: 28676104 PMCID: PMC5496257 DOI: 10.1186/s12864-017-3884-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Accepted: 06/20/2017] [Indexed: 12/31/2022] Open
Abstract
Background Efforts to resolve the transcribed sequences in the equine genome have focused on protein-coding RNA. The transcription of the intergenic regions, although detected via total RNA sequencing (RNA-seq), has yet to be characterized in the horse. The most recent equine transcriptome based on RNA-seq from several tissues was a prime opportunity to obtain a concurrent long non-coding RNA (lncRNA) database. Results This lncRNA database has a breadth of eight tissues and a depth of over 20 million reads for select tissues, providing the deepest and most expansive equine lncRNA database. Utilizing the intergenic reads and three categories of novel genes from a previously published equine transcriptome pipeline, we better describe these groups by annotating the lncRNA candidates. These lncRNA candidates were filtered using an approach adapted from human lncRNA annotation, which removes transcripts based on size, expression, protein-coding capability and distance to the start or stop of annotated protein-coding transcripts. Conclusion Our equine lncRNA database has 20,800 transcripts that demonstrate characteristics unique to lncRNA including low expression, low exon diversity and low levels of sequence conservation. These candidate lncRNA will serve as a baseline lncRNA annotation and begin to describe the RNA-seq reads assigned to the intergenic space in the horse. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3884-2) contains supplementary material, which is available to authorized users.
Collapse
|
21
|
Lopez-Ezquerra A, Harrison MC, Bornberg-Bauer E. Comparative analysis of lincRNA in insect species. BMC Evol Biol 2017; 17:155. [PMID: 28673235 PMCID: PMC5494802 DOI: 10.1186/s12862-017-0985-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 06/02/2017] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND The ever increasing availability of genomes makes it possible to investigate and compare not only the genomic complements of genes and proteins, but also of RNAs. One class of RNAs, the long noncoding RNAs (lncRNAs) and, in particular, their subclass of long intergenic noncoding RNAs (lincRNAs) have recently gained much attention because of their roles in regulation of important biological processes such as immune response or cell differentiation and as possible evolutionary precursors for protein coding genes. lincRNAs seem to be poorly conserved at the sequence level but at least some lincRNAs have conserved structural elements and syntenic genomic positions. Previous studies showed that transposable elements are a main contribution to the evolution of lincRNAs in mammals. In contrast, plant lincRNA emergence and evolution has been linked with local duplication events. However, little is known about their evolutionary dynamics in general and in insect genomes in particular. RESULTS Here we compared lincRNAs between seven insect genomes and investigated possible evolutionary changes and functional roles. We find very low sequence conservation between different species and that similarities within a species are mostly due to their association with transposable elements (TE) and simple repeats. Furthermore, we find that TEs are less frequent in lincRNA exons than in their introns, indicating that TEs may have been removed by selection. When we analysed the predicted thermodynamic stabilities of lincRNAs we found that they are more stable than their randomized controls which might indicate some selection pressure to maintain certain structural elements. We list several of the most stable lincRNAs which could serve as prime candidates for future functional studies. We also discuss the possibility of de novo protein coding genes emerging from lincRNAs. This is because lincRNAs with high GC content and potentially with longer open reading frames (ORF) are candidate loci where de novo gene emergence might occur. CONCLUSION The processes responsible for the emergence and diversification of lincRNAs in insects remain unclear. Both duplication and transposable elements may be important for the creation of new lincRNAs in insects.
Collapse
Affiliation(s)
- Alberto Lopez-Ezquerra
- Institute of Evolution and Biodiversity, University of Münster, Hüfferstrasse,1, Münster, Münster, Germany
| | - Mark C Harrison
- Institute of Evolution and Biodiversity, University of Münster, Hüfferstrasse,1, Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute of Evolution and Biodiversity, University of Münster, Hüfferstrasse,1, Münster, Münster, Germany.
| |
Collapse
|