1
|
Xiao L, Fang L, Zhong W, Kool ET. RNA infrastructure profiling illuminates transcriptome structure in crowded spaces. Cell Chem Biol 2024:S2451-9456(24)00405-7. [PMID: 39447577 DOI: 10.1016/j.chembiol.2024.09.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 06/17/2024] [Accepted: 09/26/2024] [Indexed: 10/26/2024]
Abstract
RNAs fold into compact structures and undergo protein interactions in cells. These occluded environments can block reagents that probe the underlying RNAs. Probes that can analyze structure in crowded settings can shed light on RNA biology. Here, we employ 2'-OH-reactive probes that are small enough to access folded RNA structure underlying close molecular contacts within cells, providing considerably broader coverage for intracellular RNA structural analysis. The data are analyzed first with well-characterized human ribosomal RNAs and then applied transcriptome-wide to polyadenylated transcripts. The smallest probe acetylimidazole (AcIm) yields 80% greater structural coverage than larger conventional reagent NAIN3, providing enhanced structural information in hundreds of transcripts. The acetyl probe also provides superior signals for identifying m6A modification sites in transcripts, particularly in sites that are inaccessible to a standard probe. Our strategy enables profiling RNA infrastructure, enhancing analysis of transcriptome structure, modification, and intracellular interactions, especially in spatially crowded settings.
Collapse
Affiliation(s)
- Lu Xiao
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Linglan Fang
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Wenrui Zhong
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Eric T Kool
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA; Sarafan ChEM-H Institute, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
2
|
Sebutsoe XM, Tsotetsi NJN, Jantjies ZE, Raphela-Choma PP, Choene MS, Motadi LR. Therapeutic Strategies in Advanced Cervical Cancer Detection, Prevention and Treatment. Onco Targets Ther 2024; 17:785-801. [PMID: 39345275 PMCID: PMC11439348 DOI: 10.2147/ott.s475132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 08/08/2024] [Indexed: 10/01/2024] Open
Abstract
Cervical cancer is ranked the fourth most common cause of cancer related deaths amongst women. The situation is particularly dire in low to lower middle-income countries. It continues to affect these countries due to poor vaccine coverage and screening. Cervical cancer is mostly detected in the advanced stages leading to poor outcomes. This review focuses on the progress made to date to improve early detection and targeted therapy using both circulating RNA. Vaccine has played a major role in cervical cancer control in vaccinated young woman in mainly developed countries yet in low-income countries with challenges of 3 dose vaccination affordability, cervical cancer continues to be the second most deadly amongst women. In this review, we show the progress made in reducing cervical cancer using vaccination that in combination with other treatments that might improve survival in cervical cancer. We further show with both miRNA and siRNA that targeted therapy and specific markers might be ideal for early detection of cervical cancer in low-income countries. These markers are either upregulated or down regulated in cancer providing clue to the stage of the cancer.
Collapse
Affiliation(s)
- Xolisiwe M Sebutsoe
- Department of Biochemistry C2 Lab, University of Johannesburg, Auckland Park Kingsway Campus, Johannesburg, South Africa
| | | | - Zodwa Edith Jantjies
- Department of Biochemistry C2 Lab, University of Johannesburg, Auckland Park Kingsway Campus, Johannesburg, South Africa
| | - Portia Pheladi Raphela-Choma
- Department of Biochemistry C2 Lab, University of Johannesburg, Auckland Park Kingsway Campus, Johannesburg, South Africa
| | - Mpho S Choene
- Department of Biochemistry C2 Lab, University of Johannesburg, Auckland Park Kingsway Campus, Johannesburg, South Africa
| | - Lesetja R Motadi
- Department of Biochemistry C2 Lab, University of Johannesburg, Auckland Park Kingsway Campus, Johannesburg, South Africa
| |
Collapse
|
3
|
Sha CM, Wang J, Dokholyan NV. Predicting 3D RNA structure from the nucleotide sequence using Euclidean neural networks. Biophys J 2024; 123:2671-2681. [PMID: 37838833 PMCID: PMC11393712 DOI: 10.1016/j.bpj.2023.10.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 09/19/2023] [Accepted: 10/12/2023] [Indexed: 10/16/2023] Open
Abstract
Fast and accurate 3D RNA structure prediction remains a major challenge in structural biology, mostly due to the size and flexibility of RNA molecules, as well as the lack of diverse experimentally determined structures of RNA molecules. Unlike DNA structure, RNA structure is far less constrained by basepair hydrogen bonding, resulting in an explosion of potential stable states. Here, we propose a convolutional neural network that predicts all pairwise distances between residues in an RNA, using a recently described smooth parametrization of Euclidean distance matrices. We achieve high-accuracy predictions on RNAs up to 100 nt in length in fractions of a second, a factor of 107 faster than existing molecular dynamics-based methods. We also convert our coarse-grained machine learning output into an all-atom model using discrete molecular dynamics with constraints. Our proposed computational pipeline predicts all-atom RNA models solely from the nucleotide sequence. However, this method suffers from the same limitation as nucleic acid molecular dynamics: the scarcity of available RNA crystal structures for training.
Collapse
Affiliation(s)
- Congzhou M Sha
- Department of Engineering Science and Mechanics, Penn State University, State College, Pennsylvania; Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania
| | - Jian Wang
- Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania
| | - Nikolay V Dokholyan
- Department of Engineering Science and Mechanics, Penn State University, State College, Pennsylvania; Department of Pharmacology, Penn State College of Medicine, Hershey, Pennsylvania; Department of Biochemistry and Molecular Biology, Penn State College of Medicine, Hershey, Pennsylvania; Department of Chemistry, Penn State University, State College, Pennsylvania; Department of Biomedical Engineering, Penn State University, State College, Pennsylvania.
| |
Collapse
|
4
|
Shelton WJ, Zandpazandi S, Nix JS, Gokden M, Bauer M, Ryan KR, Wardell CP, Vaske OM, Rodriguez A. Long-read sequencing for brain tumors. Front Oncol 2024; 14:1395985. [PMID: 38915364 PMCID: PMC11194609 DOI: 10.3389/fonc.2024.1395985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 05/27/2024] [Indexed: 06/26/2024] Open
Abstract
Brain tumors and genomics have a long-standing history given that glioblastoma was the first cancer studied by the cancer genome atlas. The numerous and continuous advances through the decades in sequencing technologies have aided in the advanced molecular characterization of brain tumors for diagnosis, prognosis, and treatment. Since the implementation of molecular biomarkers by the WHO CNS in 2016, the genomics of brain tumors has been integrated into diagnostic criteria. Long-read sequencing, also known as third generation sequencing, is an emerging technique that allows for the sequencing of longer DNA segments leading to improved detection of structural variants and epigenetics. These capabilities are opening a way for better characterization of brain tumors. Here, we present a comprehensive summary of the state of the art of third-generation sequencing in the application for brain tumor diagnosis, prognosis, and treatment. We discuss the advantages and potential new implementations of long-read sequencing into clinical paradigms for neuro-oncology patients.
Collapse
Affiliation(s)
- William J Shelton
- Department of Neurosurgery, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Sara Zandpazandi
- Department of Neurosurgery, Medical University of South Carolina, Charleston, SC, United States
| | - J Stephen Nix
- Department of Pathology, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Murat Gokden
- Department of Pathology, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Michael Bauer
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Katie Rose Ryan
- Department of Biochemistry and Molecular Biology, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Christopher P Wardell
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Olena Morozova Vaske
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA, United States
| | - Analiz Rodriguez
- Department of Neurosurgery, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| |
Collapse
|
5
|
Xiao L, Fang L, Kool ET. 2'-OH as a universal handle for studying intracellular RNAs. Cell Chem Biol 2024; 31:110-124. [PMID: 37992716 PMCID: PMC10841764 DOI: 10.1016/j.chembiol.2023.10.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 09/28/2023] [Accepted: 10/29/2023] [Indexed: 11/24/2023]
Abstract
RNA plays pivotal roles in most cellular processes, serving as both the traditional carrier of genetic information and as a key regulator of cellular functions. The advent of chemical technologies has contributed critically to the analysis of cellular RNA structures, functions, and interactions. Many of these methods and molecules involve the utilization of chemically reactive handles in RNAs, either introduced externally or inherent within the polymer itself. Among these handles, the 2'-hydroxyl (2'-OH) group has emerged as an exceptionally well-suited and general chemical moiety for the modification and profiling of RNAs in intracellular studies. In this review, we provide an overview of the recent advancements in intracellular applications of acylation at the 2'-OH group of RNA. We outline progress made in probing RNA structure and interactomes, controlling RNA function, RNA imaging, and analyzing RNA-small molecule interactions, all achieved in living cells through this simple chemical handle on the biopolymer.
Collapse
Affiliation(s)
- Lu Xiao
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Linglan Fang
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA
| | - Eric T Kool
- Department of Chemistry, Stanford University, Stanford, CA 94305, USA; Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
6
|
Xiao L, Fang L, Kool ET. RNA Infrastructure Profiling Illuminates Transcriptome Structure in Crowded Spaces. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561413. [PMID: 37873487 PMCID: PMC10592667 DOI: 10.1101/2023.10.09.561413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
RNAs can fold into compact three-dimensional structures, and most RNAs undergo protein interactions in the cell. These compact and occluded environments can block the ability of structure-probing agents to provide useful data about the folding and modification of the underlying RNA. The development of probes that can analyze structure in crowded settings, and differentiate the proximity of interactions, can shed new light on RNA biology. To this end, here we employ 2'-OH-reactive probes that are small enough to access folded RNA structure underlying many close molecular contacts within cells, providing considerably broader coverage for intracellular RNA structural analysis. We compare reverse transcriptase stops in RNA-Seq data from probes of small and standard size to assess RNA-protein proximity and evaluate solvent-exposed tunnels adjacent to RNA. The data are analyzed first with structurally characterized complexes (human 18S and 28S RNA), and then applied transcriptome-wide to polyadenylated transcripts in HEK293 cells. In our transcriptome profile, the smallest probe acetylimidazole (AcIm) yields 80% greater structural coverage than larger conventional reagent NAIN3, providing enhanced structural information in hundreds of transcripts. We further show that acetyl probes provide superior signals for identifying m6A modification sites in transcripts, and provide information regarding methylation sites that are inaccessible to a larger standard probe. RNA infrastructure profiling (RISP) enables enhanced analysis of transcriptome structure, modification, and interactions in living cells, especially in spatially crowded settings.
Collapse
Affiliation(s)
- Lu Xiao
- Department of Chemistry, Stanford University, Stanford, CA 94305, United States
| | - Linglan Fang
- Department of Chemistry, Stanford University, Stanford, CA 94305, United States
| | - Eric T. Kool
- Department of Chemistry, Stanford University, Stanford, CA 94305, United States
- Sarafan ChEM-H Institute, Stanford University, Stanford, CA 94305, United States
| |
Collapse
|
7
|
Wang F, Cao H, Xia Q, Liu Z, Wang M, Gao F, Xu D, Deng B, Diao Y, Kapranov P. Lessons from discovery of true ADAR RNA editing sites in a human cell line. BMC Biol 2023; 21:160. [PMID: 37468903 PMCID: PMC10357658 DOI: 10.1186/s12915-023-01651-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 06/20/2023] [Indexed: 07/21/2023] Open
Abstract
BACKGROUND Conversion or editing of adenosine (A) into inosine (I) catalyzed by specialized cellular enzymes represents one of the most common post-transcriptional RNA modifications with emerging connection to disease. A-to-I conversions can happen at specific sites and lead to increase in proteome diversity and changes in RNA stability, splicing, and regulation. Such sites can be detected as adenine-to-guanine sequence changes by next-generation RNA sequencing which resulted in millions reported sites from multiple genome-wide surveys. Nonetheless, the lack of extensive independent validation in such endeavors, which is critical considering the relatively high error rate of next-generation sequencing, leads to lingering questions about the validity of the current compendiums of the editing sites and conclusions based on them. RESULTS Strikingly, we found that the current analytical methods suffer from very high false positive rates and that a significant fraction of sites in the public databases cannot be validated. In this work, we present potential solutions to these problems and provide a comprehensive and extensively validated list of A-to-I editing sites in a human cancer cell line. Our findings demonstrate that most of true A-to-I editing sites in a human cancer cell line are located in the non-coding transcripts, the so-called RNA 'dark matter'. On the other hand, many ADAR editing events occurring in exons of human protein-coding mRNAs, including those that can recode the transcriptome, represent false positives and need to be interpreted with caution. Nonetheless, yet undiscovered authentic ADAR sites that increase the diversity of human proteome exist and warrant further identification. CONCLUSIONS Accurate identification of human ADAR sites remains a challenging problem, particularly for the sites in exons of protein-coding mRNAs. As a result, genome-wide surveys of ADAR editome must still be accompanied by extensive Sanger validation efforts. However, given the vast number of unknown human ADAR sites, there is a need for further developments of the analytical techniques, potentially those that are based on deep learning solutions, in order to provide a quick and reliable identification of the editome in any sample.
Collapse
Affiliation(s)
- Fang Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Huifen Cao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
| | - Qiu Xia
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Ziheng Liu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Ming Wang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Fan Gao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Bolin Deng
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Yong Diao
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen, 361021, China.
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Xiamen University, Xiamen, 361102, China.
| |
Collapse
|
8
|
Xu H, Li C, Xu C, Zhang J. Chance promoter activities illuminate the origins of eukaryotic intergenic transcriptions. Nat Commun 2023; 14:1826. [PMID: 37005399 PMCID: PMC10067814 DOI: 10.1038/s41467-023-37610-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/23/2023] [Indexed: 04/04/2023] Open
Abstract
It is debated whether the pervasive intergenic transcription from eukaryotic genomes has functional significance or simply reflects the promiscuity of RNA polymerases. We approach this question by comparing chance promoter activities with the expression levels of intergenic regions in the model eukaryote Saccharomyces cerevisiae. We build a library of over 105 strains, each carrying a 120-nucleotide, chromosomally integrated, completely random sequence driving the potential transcription of a barcode. Quantifying the RNA concentration of each barcode in two environments reveals that 41-63% of random sequences have significant, albeit usually low, promoter activities. Therefore, even in eukaryotes, where the presence of chromatin is thought to repress transcription, chance transcription is prevalent. We find that only 1-5% of yeast intergenic transcriptions are unattributable to chance promoter activities or neighboring gene expressions, and these transcriptions exhibit higher-than-expected environment-specificity. These findings suggest that only a minute fraction of intergenic transcription is functional in yeast.
Collapse
Affiliation(s)
- Haiqing Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Chuan Li
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
- Microsoft, Redmond, WA, USA
| | - Chuan Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
- Bio-X Institutes, Shanghai Jiao Tong University, Shanghai, China
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
9
|
Tsakiroglou M, Evans A, Pirmohamed M. Leveraging transcriptomics for precision diagnosis: Lessons learned from cancer and sepsis. Front Genet 2023; 14:1100352. [PMID: 36968610 PMCID: PMC10036914 DOI: 10.3389/fgene.2023.1100352] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 02/20/2023] [Indexed: 03/12/2023] Open
Abstract
Diagnostics require precision and predictive ability to be clinically useful. Integration of multi-omic with clinical data is crucial to our understanding of disease pathogenesis and diagnosis. However, interpretation of overwhelming amounts of information at the individual level requires sophisticated computational tools for extraction of clinically meaningful outputs. Moreover, evolution of technical and analytical methods often outpaces standardisation strategies. RNA is the most dynamic component of all -omics technologies carrying an abundance of regulatory information that is least harnessed for use in clinical diagnostics. Gene expression-based tests capture genetic and non-genetic heterogeneity and have been implemented in certain diseases. For example patients with early breast cancer are spared toxic unnecessary treatments with scores based on the expression of a set of genes (e.g., Oncotype DX). The ability of transcriptomics to portray the transcriptional status at a moment in time has also been used in diagnosis of dynamic diseases such as sepsis. Gene expression profiles identify endotypes in sepsis patients with prognostic value and a potential to discriminate between viral and bacterial infection. The application of transcriptomics for patient stratification in clinical environments and clinical trials thus holds promise. In this review, we discuss the current clinical application in the fields of cancer and infection. We use these paradigms to highlight the impediments in identifying useful diagnostic and prognostic biomarkers and propose approaches to overcome them and aid efforts towards clinical implementation.
Collapse
Affiliation(s)
- Maria Tsakiroglou
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
- *Correspondence: Maria Tsakiroglou,
| | - Anthony Evans
- Computational Biology Facility, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Munir Pirmohamed
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
10
|
Gao F, Wang F, Cao H, Chen Y, Diao Y, Kapranov P. Evidence for Existence of Multiple Functional Human Small RNAs Derived from Transcripts of Protein-Coding Genes. Int J Mol Sci 2023; 24:4163. [PMID: 36835575 PMCID: PMC9959880 DOI: 10.3390/ijms24044163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 02/10/2023] [Accepted: 02/15/2023] [Indexed: 02/22/2023] Open
Abstract
The human genome encodes a multitude of different noncoding transcripts that have been traditionally separated on the basis of their lengths into long (>200 nt) or small (<200 nt) noncoding RNAs. The functions, mechanisms of action, and biological relevance of the vast majority of both long and short noncoding transcripts remain unknown. However, according to the functional understanding of the known classes of long and small noncoding RNAs (sncRNAs) that have been shown to play crucial roles in multiple biological processes, it is generally assumed that many unannotated long and small transcripts participate in important cellular functions as well. Nevertheless, direct evidence of functionality is lacking for most noncoding transcripts, especially for sncRNAs that are often dismissed as stable degradation products of longer RNAs. Here, we developed a high-throughput assay to test the functionality of sncRNAs by overexpressing them in human cells. Surprisingly, we found that a significant fraction (>40%) of unannotated sncRNAs appear to have biological relevance. Furthermore, contrary to the expectation, the potentially functional transcripts are not highly abundant and can be derived from protein-coding mRNAs. These results strongly suggest that the small noncoding transcriptome can harbor multiple functional transcripts that warrant future studies.
Collapse
Affiliation(s)
| | | | | | | | | | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen 361021, China
| |
Collapse
|
11
|
Cao H, Zhang Y, Cai Y, Tang L, Gao F, Xu D, Kapranov P. Hotspots of single-strand DNA “breakome” are enriched at transcriptional start sites of genes. Front Mol Biosci 2022; 9:895795. [PMID: 36046604 PMCID: PMC9420937 DOI: 10.3389/fmolb.2022.895795] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Accepted: 06/27/2022] [Indexed: 01/01/2023] Open
Abstract
Single-strand breaks (SSBs) represent one of the most common types of DNA damage, yet not much is known about the genome landscapes of this type of DNA lesions in mammalian cells. Here, we found that SSBs are more likely to occur in certain positions of the human genome—SSB hotspots—in different cells of the same cell type and in different cell types. We hypothesize that the hotspots are likely to represent biologically relevant breaks. Furthermore, we found that the hotspots had a prominent tendency to be enriched in the immediate vicinity of transcriptional start sites (TSSs). We show that these hotspots are not likely to represent technical artifacts or be caused by common mechanisms previously found to cause DNA cleavage at promoters, such as apoptotic DNA fragmentation or topoisomerase type II (TOP2) activity. Therefore, such TSS-associated hotspots could potentially be generated using a novel mechanism that could involve preferential cleavage at cytosines, and their existence is consistent with recent studies suggesting a complex relationship between DNA damage and regulation of gene expression.
Collapse
|
12
|
Roy S, Ganguly N, Banerjee S. Exploring clinical implications and role of non-coding RNAs in lung carcinogenesis. Mol Biol Rep 2022; 49:6871-6883. [PMID: 35076850 DOI: 10.1007/s11033-022-07159-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 01/18/2022] [Indexed: 12/12/2022]
Abstract
Lung cancer is the utmost familiar category of cancer with greatest fatality rate worldwide and several regulatory mechanisms exercise cellular control on critical oncogenic trails implicated in lung associated carcinogenesis. The non-coding RNAs (ncRNAs) are shown to play a variety of regulatory roles, including stimulating cell proliferation, inhibiting programmed cell death, enhancing cancer cell metastatic ability and acquiring resistance to drugs. Furthermore, ncRNAs exhibit tissue-specific expression as well as great stability in bodily fluids. As a consequence, they are strong contenders for cancer based theragnostics. microRNA (miRNA) alters gene expression primarily by either degrading or interfering with the translation of targeted mRNA and long non-coding RNAs (lncRNAs) can influence gene expression by targeting transcriptional activators or repressors, RNA polymers and even DNA-duplex. lncRNAs are typically found to be dysregulated in lung cancer and hence targeting ncRNAs could be a viable strategy for developing potential therapies as well as for overcoming chemoresistance in lung cancer. The purpose of this review is to elucidate the role of ncRNAs, revisiting the recent studies in lung cancer.
Collapse
Affiliation(s)
- Swagata Roy
- School of Bioscience and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, 632 014, India
| | - Neeldeep Ganguly
- School of Bioscience and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, 632 014, India
| | - Satarupa Banerjee
- School of Bioscience and Technology, Vellore Institute of Technology, Vellore, Tamil Nadu, 632 014, India.
| |
Collapse
|
13
|
Akhlaghpour H. An RNA-Based Theory of Natural Universal Computation. J Theor Biol 2021; 537:110984. [PMID: 34979104 DOI: 10.1016/j.jtbi.2021.110984] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 09/30/2021] [Accepted: 12/07/2021] [Indexed: 12/15/2022]
Abstract
Life is confronted with computation problems in a variety of domains including animal behavior, single-cell behavior, and embryonic development. Yet we currently do not know of a naturally existing biological system that is capable of universal computation, i.e., Turing-equivalent in scope. Generic finite-dimensional dynamical systems (which encompass most models of neural networks, intracellular signaling cascades, and gene regulatory networks) fall short of universal computation, but are assumed to be capable of explaining cognition and development. I present a class of models that bridge two concepts from distant fields: combinatory logic (or, equivalently, lambda calculus) and RNA molecular biology. A set of basic RNA editing rules can make it possible to compute any computable function with identical algorithmic complexity to that of Turing machines. The models do not assume extraordinarily complex molecular machinery or any processes that radically differ from what we already know to occur in cells. Distinct independent enzymes can mediate each of the rules and RNA molecules solve the problem of parenthesis matching through their secondary structure. In the most plausible of these models all of the editing rules can be implemented with merely cleavage and ligation operations at fixed positions relative to predefined motifs. This demonstrates that universal computation is well within the reach of molecular biology. It is therefore reasonable to assume that life has evolved - or possibly began with - a universal computer that yet remains to be discovered. The variety of seemingly unrelated computational problems across many scales can potentially be solved using the same RNA-based computation system. Experimental validation of this theory may immensely impact our understanding of memory, cognition, development, disease, evolution, and the early stages of life.
Collapse
Affiliation(s)
- Hessameddin Akhlaghpour
- Laboratory of Integrative Brain Function, The Rockefeller University, New York, NY, 10065, USA
| |
Collapse
|
14
|
Jing Y, Jiang X, Lei L, Peng M, Ren J, Xiao Q, Tao Y, Tao Y, Huang J, Wang L, Tang Y, Yang Z, Yang Z, Zhang L. Mutant NPM1-regulated lncRNA HOTAIRM1 promotes leukemia cell autophagy and proliferation by targeting EGR1 and ULK3. JOURNAL OF EXPERIMENTAL & CLINICAL CANCER RESEARCH : CR 2021; 40:312. [PMID: 34615546 PMCID: PMC8493742 DOI: 10.1186/s13046-021-02122-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 09/28/2021] [Indexed: 12/18/2022]
Abstract
Background Acute myeloid leukemia (AML) with mutated nucleophosmin (NPM1), which displays a distinct long noncoding RNA (lncRNA) expression profile, has been defined as a unique subgroup in the new classification of myeloid neoplasms. However, the biological roles of key lncRNAs in the development of NPM1-mutated AML are currently unclear. Here, we aimed to investigate the functional and mechanistic roles of the lncRNA HOTAIRM1 in NPM1-mutated AML. Methods The expression of HOTAIRM1 was analyzed with a public database and further determined by qRT-PCR in NPM1-mutated AML samples and cell lines. The cause of upregulated HOTAIRM1 expression was investigated by luciferase reporter, chromatin immunoprecipitation and ubiquitination assays. The functional role of HOTAIRM1 in autophagy and proliferation was evaluated using western blot analysis, immunofluorescence staining, a Cell Counting Kit-8 (CCK-8) assay, a 5-ethynyl-2′-deoxyuridine (EdU) incorporation assay, flow cytometric analyses and animal studies. The action mechanism of HOTAIRM1 was explored through RNA fluorescence in situ hybridization, RNA pulldown and RNA immunoprecipitation assays. Results HOTAIRM1 was highly expressed in NPM1-mutated AML. High HOTAIRM1 expression was induced in part by mutant NPM1 via KLF5-dependent transcriptional regulation. Importantly, HOTAIRM1 promoted autophagy and proliferation both in vitro and in vivo. Mechanistic investigations demonstrated that nuclear HOTAIRM1 promoted EGR1 degradation by serving as a scaffold to facilitate MDM2-EGR1 complex formation, while cytoplasmic HOTAIRM1 acted as a sponge for miR-152-3p to increase ULK3 expression. Conclusions Taken together, our findings identify two oncogenic regulatory axes in NPM1-mutated AML centered on HOTAIRM1: one involving EGR1 and MDM2 in the nucleus and the other involving the miR-152-3p/ULK3 axis in the cytoplasm. Our study indicates that HOTAIRM1 may be a promising therapeutic target for this distinct leukemia subtype. Supplementary Information The online version contains supplementary material available at 10.1186/s13046-021-02122-2.
Collapse
Affiliation(s)
- Yipei Jing
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Xueke Jiang
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Li Lei
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Meixi Peng
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Jun Ren
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Qiaoling Xiao
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Yao Tao
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Yonghong Tao
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Junpeng Huang
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Lu Wang
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Yuting Tang
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China
| | - Zailin Yang
- Department of Clinical Laboratory, The Third Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Zesong Yang
- Department of Hematology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China
| | - Ling Zhang
- Key Laboratory of Laboratory Medical Diagnostics Designated by the Ministry of Education, School of Laboratory Medicine, Chongqing Medical University, No.1, Yixueyuan Road, Chongqing, 400016, China.
| |
Collapse
|
15
|
Le P, Romano G, Nana-Sinkam P, Acunzo M. Non-Coding RNAs in Cancer Diagnosis and Therapy: Focus on Lung Cancer. Cancers (Basel) 2021; 13:cancers13061372. [PMID: 33803619 PMCID: PMC8003033 DOI: 10.3390/cancers13061372] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 03/03/2021] [Accepted: 03/11/2021] [Indexed: 02/06/2023] Open
Abstract
Over the last several decades, clinical evaluation and treatment of lung cancers have largely improved with the classification of genetic drivers of the disease, such as EGFR, ALK, and ROS1. There are numerous regulatory factors that exert cellular control over key oncogenic pathways involved in lung cancers. In particular, non-coding RNAs (ncRNAs) have a diversity of regulatory roles in lung cancers such that they have been shown to be involved in inducing proliferation, suppressing apoptotic pathways, increasing metastatic potential of cancer cells, and acquiring drug resistance. The dysregulation of various ncRNAs in human cancers has prompted preclinical studies examining the therapeutic potential of restoring and/or inhibiting these ncRNAs. Furthermore, ncRNAs demonstrate tissue-specific expression in addition to high stability within biological fluids. This makes them excellent candidates as cancer biomarkers. This review aims to discuss the relevance of ncRNAs in cancer pathology, diagnosis, and therapy, with a focus on lung cancer.
Collapse
|
16
|
Gourvest M, De Clara E, Wu HC, Touriol C, Meggetto F, De Thé H, Pyronnet S, Brousset P, Bousquet M. A novel leukemic route of mutant NPM1 through nuclear import of the overexpressed long noncoding RNA LONA. Leukemia 2021; 35:2784-2798. [PMID: 34131282 PMCID: PMC8205207 DOI: 10.1038/s41375-021-01307-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 05/11/2021] [Accepted: 05/21/2021] [Indexed: 02/06/2023]
Abstract
The most frequent genetic alteration in acute myeloid leukemia (AML) is the mutation of nucleophosmin 1 (NPM1). Yet, its downstream oncogenic routes are not fully understood. Here, we report the identification of one long noncoding RNA (lncRNA) overexpressed in NPM1-mutated AML patients (named LONA) whose intracellular localization inversely reflects that of NPM1. While NPM1 is nuclear and LONA cytoplasmic in wild-type NPM1 AML cells, LONA becomes nuclear as mutant NPM1 moves toward the cytoplasm. Gain or loss of function combined with a genome-wide RNA-seq search identified a set of LONA mRNA targets encoding proteins involved in myeloid cell differentiation (including THSB1, MAFB, and ASB2) and interaction with its microenvironment. Consistently, LONA overexpression in mutant NPM1 established cell lines and primary AML cells exerts an anti-myeloid differentiation effect, whilst it exerts an opposite pro-myeloid differentiation effect in a wild type NPM1 setting. In vivo, LONA overexpression acts as an oncogenic lncRNA reducing the survival of mice transplanted with AML cells and rendering AML tumors more resistant to AraC chemotherapy.These data indicate that mutation-dependent nuclear export of NPM1 leads to nuclear retention and consequent oncogenic functions of the overexpressed lncRNA LONA, thus uncovering a novel NPM1 mutation-dependent pathway in AML pathogenesis.
Collapse
Affiliation(s)
- Morgane Gourvest
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| | - Etienne De Clara
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| | - Hsin-Chieh Wu
- grid.410533.00000 0001 2179 2236INSERM U944 and 1050, IRSL, University of Paris and PSL, Hôpital St. Louis and Collège de France, Paris, France
| | - Christian Touriol
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| | - Fabienne Meggetto
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| | - Hugues De Thé
- grid.410533.00000 0001 2179 2236INSERM U944 and 1050, IRSL, University of Paris and PSL, Hôpital St. Louis and Collège de France, Paris, France
| | - Stéphane Pyronnet
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| | - Pierre Brousset
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| | - Marina Bousquet
- grid.468186.5Cancer Research Center of Toulouse (CRCT), UMR1037 Inserm/Université Toulouse III Paul Sabatier, ERL5294 CNRS, Laboratoire d’excellence Toulouse Cancer (TOUCAN), Toulouse, France
| |
Collapse
|
17
|
Puntambekar S, Newhouse R, San-Miguel J, Chauhan R, Vernaz G, Willis T, Wayland MT, Umrania Y, Miska EA, Prabakaran S. Evolutionary divergence of novel open reading frames in cichlids speciation. Sci Rep 2020; 10:21570. [PMID: 33299045 PMCID: PMC7726158 DOI: 10.1038/s41598-020-78555-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 11/26/2020] [Indexed: 01/02/2023] Open
Abstract
Novel open reading frames (nORFs) with coding potential may arise from noncoding DNA. Not much is known about their emergence, functional role, fixation in a population or contribution to adaptive radiation. Cichlids fishes exhibit extensive phenotypic diversification and speciation. Encounters with new environments alone are not sufficient to explain this striking diversity of cichlid radiation because other taxa coexistent with the Cichlidae demonstrate lower species richness. Wagner et al. analyzed cichlid diversification in 46 African lakes and reported that both extrinsic environmental factors and intrinsic lineage-specific traits related to sexual selection have strongly influenced the cichlid radiation, which indicates the existence of unknown molecular mechanisms responsible for rapid phenotypic diversification, such as emergence of novel open reading frames (nORFs). In this study, we integrated transcriptomic and proteomic signatures from two tissues of two cichlids species, identified nORFs and performed evolutionary analysis on these nORF regions. Our results suggest that the time scale of speciation of the two species and evolutionary divergence of these nORF genomic regions are similar and indicate a potential role for these nORFs in speciation of the cichlid fishes.
Collapse
Affiliation(s)
- Shraddha Puntambekar
- Department of Biology, Indian Institute of Science Education and Research, Pune, Maharashtra, 411008, India
| | - Rachel Newhouse
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
| | - Jaime San-Miguel
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
| | - Ruchi Chauhan
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
| | - Grégoire Vernaz
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
- The Wellcome Trust/CRUK Gurdon Institute, University of Cambridge, Cambridge, CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, CB10 1SA, UK
| | - Thomas Willis
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
| | - Matthew T Wayland
- Department of Zoology, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
| | - Yagnesh Umrania
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Eric A Miska
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, CB10 1SA, UK
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QR, UK
| | - Sudhakaran Prabakaran
- Department of Biology, Indian Institute of Science Education and Research, Pune, Maharashtra, 411008, India.
- Department of Genetics, University of Cambridge, Downing Site, Cambridge, CB2 3EH, UK.
- St. Edmund's College, University of Cambridge, Cambridge, CB3 0BN, UK.
| |
Collapse
|
18
|
Palazzo AF, Kang YM. GC-content biases in protein-coding genes act as an "mRNA identity" feature for nuclear export. Bioessays 2020; 43:e2000197. [PMID: 33165929 DOI: 10.1002/bies.202000197] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/30/2020] [Accepted: 10/01/2020] [Indexed: 01/11/2023]
Abstract
It has long been observed that human protein-coding genes have a particular distribution of GC-content: the 5' end of these genes has high GC-content while the 3' end has low GC-content. In 2012, it was proposed that this pattern of GC-content could act as an mRNA identity feature that would lead to it being better recognized by the cellular machinery to promote its nuclear export. In contrast, junk RNA, which largely lacks this feature, would be retained in the nucleus and targeted for decay. Now two recent papers have provided evidence that GC-content does promote the nuclear export of many mRNAs in human cells.
Collapse
Affiliation(s)
- Alexander F Palazzo
- Department of Biochemistry, University of Toronto, Toronto, ON, M5G 1M1, Canada
| | - Yoon Mo Kang
- Department of Biochemistry, University of Toronto, Toronto, ON, M5G 1M1, Canada
| |
Collapse
|
19
|
Reply to Verwilt et al.: Experimental evidence against DNA contamination in SILVER-seq. Proc Natl Acad Sci U S A 2020; 117:18937-18938. [PMID: 32788395 PMCID: PMC7431041 DOI: 10.1073/pnas.2008585117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
|
20
|
Pande A, Makalowski W, Brosius J, Raabe CA. Enhancer occlusion transcripts regulate the activity of human enhancer domains via transcriptional interference: a computational perspective. Nucleic Acids Res 2020; 48:3435-3454. [PMID: 32133533 PMCID: PMC7144904 DOI: 10.1093/nar/gkaa026] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2018] [Revised: 12/27/2019] [Accepted: 01/31/2020] [Indexed: 02/05/2023] Open
Abstract
Analysis of ENCODE long RNA-Seq and ChIP-seq (Chromatin Immunoprecipitation Sequencing) datasets for HepG2 and HeLa cell lines uncovered 1647 and 1958 transcripts that interfere with transcription factor binding to human enhancer domains. TFBSs (Transcription Factor Binding Sites) intersected by these 'Enhancer Occlusion Transcripts' (EOTrs) displayed significantly lower relative transcription factor (TF) binding affinities compared to TFBSs for the same TF devoid of EOTrs. Expression of most EOTrs was regulated in a cell line specific manner; analysis for the same TFBSs across cell lines, i.e. in the absence or presence of EOTrs, yielded consistently higher relative TF/DNA-binding affinities for TFBSs devoid of EOTrs. Lower activities of EOTr-associated enhancer domains coincided with reduced occupancy levels for histone tail modifications H3K27ac and H3K9ac. Similarly, the analysis of EOTrs with allele-specific expression identified lower activities for alleles associated with EOTrs. ChIA-PET (Chromatin Interaction Analysis by Paired-End Tag Sequencing) and 5C (Carbon Copy Chromosome Conformation Capture) uncovered that enhancer domains associated with EOTrs preferentially interacted with poised gene promoters. Analysis of EOTr regions with GRO-seq (Global run-on) data established the correlation of RNA polymerase pausing and occlusion of TF-binding. Our results implied that EOTr expression regulates human enhancer domains via transcriptional interference.
Collapse
Affiliation(s)
- Amit Pande
- Institute of Experimental Pathology, Centre for Molecular Biology of Inflammation (ZMBE), University of Münster, Von-Esmarch-Strasse 56, D-48149 Münster, Germany.,Brandenburg Medical School (MHB), Fehrbelliner Strasse 38, D-16816 Neuruppin, Germany.,Institute of Bioinformatics, University of Münster, Niels-Stensen-Strasse 14, D-48149 Münster, Germany
| | - Wojciech Makalowski
- Institute of Bioinformatics, University of Münster, Niels-Stensen-Strasse 14, D-48149 Münster, Germany
| | - Jürgen Brosius
- Institute of Experimental Pathology, Centre for Molecular Biology of Inflammation (ZMBE), University of Münster, Von-Esmarch-Strasse 56, D-48149 Münster, Germany.,Brandenburg Medical School (MHB), Fehrbelliner Strasse 38, D-16816 Neuruppin, Germany.,Institutes for Systems Genetics, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Carsten A Raabe
- Institute of Experimental Pathology, Centre for Molecular Biology of Inflammation (ZMBE), University of Münster, Von-Esmarch-Strasse 56, D-48149 Münster, Germany.,Brandenburg Medical School (MHB), Fehrbelliner Strasse 38, D-16816 Neuruppin, Germany.,Institute of Medical Biochemistry, Centre for Molecular Biology of Inflammation (ZMBE), University of Münster, Von-Esmarch-Strasse 56, D-48149 Münster, Germany
| |
Collapse
|
21
|
Evolution of novel genes in three-spined stickleback populations. Heredity (Edinb) 2020; 125:50-59. [PMID: 32499660 PMCID: PMC7413265 DOI: 10.1038/s41437-020-0319-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Revised: 04/27/2020] [Accepted: 04/30/2020] [Indexed: 12/22/2022] Open
Abstract
Eukaryotic genomes frequently acquire new protein-coding genes which may significantly impact an organism’s fitness. Novel genes can be created, for example, by duplication of large genomic regions or de novo, from previously non-coding DNA. Either way, creation of a novel transcript is an essential early step during novel gene emergence. Most studies on the gain-and-loss dynamics of novel genes so far have compared genomes between species, constraining analyses to genes that have remained fixed over long time scales. However, the importance of novel genes for rapid adaptation among populations has recently been shown. Therefore, since little is known about the evolutionary dynamics of transcripts across natural populations, we here study transcriptomes from several tissues and nine geographically distinct populations of an ecological model species, the three-spined stickleback. Our findings suggest that novel genes typically start out as transcripts with low expression and high tissue specificity. Early expression regulation appears to be mediated by gene-body methylation. Although most new and narrowly expressed genes are rapidly lost, those that survive and subsequently spread through populations tend to gain broader and higher expression levels. The properties of the encoded proteins, such as disorder and aggregation propensity, hardly change. Correspondingly, young novel genes are not preferentially under positive selection but older novel genes more often overlap with FST outlier regions. Taken together, expression of the surviving novel genes is rapidly regulated, probably via epigenetic mechanisms, while structural properties of encoded proteins are non-debilitating and might only change much later.
Collapse
|
22
|
Van der Mude A. Structure encoding in DNA. J Theor Biol 2020; 492:110205. [PMID: 32070719 DOI: 10.1016/j.jtbi.2020.110205] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Revised: 12/29/2019] [Accepted: 02/14/2020] [Indexed: 12/21/2022]
Abstract
It is proposed that transposons and related long non-coding RNA define the fine structure of body parts. Although morphogens have long been known to direct the formation of many gross structures in early embryonic development, they do not have the necessary precision to define a structure down to the individual cellular level. Using the distinction between procedural and declarative knowledge in information processing as an analogy, it is hypothesized that DNA encodes fine structure in a manner that is different from the genetic code for proteins. The hypothesis states that repeated or near-repeated sequences that are in transposons and non-coding RNA define body part structures. As the cells in a body part go through the epigenetic process of differentiation, the action of methylation serves to inactivate all but the relevant structure definitions and some associated cell type genes. The transposons left active will then physically modify the DNA sequence in the heterochromatin to establish the local context in the three-dimensional body part structure. This brings the encoded definition of the cell type to the histone. The histone code for that cell type starts the regulatory cascade that turns on the genes associated with that particular type of cell, transforming it from a multipotent cell to a fully differentiated cell. This mechanism creates structures in the musculoskeletal system, the organs of the body, the major parts of the brain, and other systems.
Collapse
|
23
|
Velema WA, Kool ET. The chemistry and applications of RNA 2'-OH acylation. Nat Rev Chem 2020; 4:22-37. [PMID: 32984545 PMCID: PMC7513686 DOI: 10.1038/s41570-019-0147-6] [Citation(s) in RCA: 45] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/18/2019] [Indexed: 12/19/2022]
Abstract
RNA is a versatile biomolecule with a broad range of biological functions that go far beyond its initially described role as a simple information carrier. The development of chemical methods to control, manipulate and modify RNA has the potential to yield new insights into its many functions and properties. Traditionally, most of these methods involved the chemical modification of RNA structure using solid-state synthesis or enzymatic transformations. However, over the past 15 years, the direct functionalization of RNA by selective acylation of the 2'-hydroxyl (2'-OH) group has emerged as a powerful alternative that enables the simple modification of both synthetic and transcribed RNAs. In this Review, we discuss the chemical properties and design of effective reagents for RNA 2'-OH acylation, highlighting the unique problem of 2'-OH reactivity in the presence of water. We elaborate on how RNA 2'-OH acylation is being exploited to develop selective chemical probes that enable interrogation of RNA structure and function, and describe new developments and applications in the field.
Collapse
Affiliation(s)
| | - Eric T. Kool
- Department of Chemistry, Stanford University, Stanford, CA, USA
| |
Collapse
|
24
|
Lessons learned from a lncRNA odyssey for two genes with vascular functions, DLL4 and TIE1. Vascul Pharmacol 2019; 114:103-109. [PMID: 30910126 DOI: 10.1016/j.vph.2018.06.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 04/24/2018] [Accepted: 06/13/2018] [Indexed: 01/30/2023]
Abstract
Pervasive transcription is a feature of the human genome that requires better understanding. Over the last decade or so, RNA species longer than 200 nucleotides-dubbed long non-coding RNA (lncRNAs)-had been found in sense or anti-sense orientation within or outside of genes that encode proteins. Importantly, lncRNA-mediated gene regulation and the elements that control lncRNA expression are a source of fascination among molecular biologists. In vascular biology, a dozen or so lncRNAs had been identified, and progress occurs each day. In this review, we highlighted our laboratories' contribution to the lncRNA field by discussing lessons learned from two lncRNAs in the tyrosine kinase containing immunoglobulin and epidermal growth factor homology1 (Tie1) and delta-like 4 (Dll4) loci. These genes are responsible for basic vascular patterning and pathophysiological remodeling in angiogenesis.
Collapse
|
25
|
Retrotransposon expression in response to in vitro inoculation with two fungal pathogens of Scots pine (Pinus sylvestris L.). BMC Res Notes 2019; 12:243. [PMID: 31036050 PMCID: PMC6489336 DOI: 10.1186/s13104-019-4275-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Accepted: 04/22/2019] [Indexed: 12/20/2022] Open
Abstract
OBJECTIVE Conifer genomes show high genetic diversity in intergenic regions that contain diverse sets of transposable elements with dominating long terminal repeat (LTR) retrotransposons (RE). Transcription of RE in response to environmental stimuli could produce various types of regulatory non-coding RNAs, but global genomic methylation changes could result in a coincidental expression of normally silent genomic regions. Expression of several RE families was evaluated in Scots pine seedlings after controlled inoculations with two fungal species that exhibit different modes of pathogenicity (necrotrophic and likely biotrophic); data compared to the overall RE distribution in genome. Recognition of regulatory non-coding RNA involved in host-pathogen interplay could be valuable in understanding defence mechanisms of perennial plants. RESULTS In the case of necrotrophic fungi Heterobasidion annosum (HA), short activation followed by restriction of RE expression was revealed after inoculation and during the spread of the pathogen. After inoculation with Lophodermium seditiosum (LS), an early increase in RE expression was revealed with the spread of the pathogen and subsequent transcription rise in all seedlings. Our observations indicate that in the complex plant genome multiple RE families constitutively express in response to pathogen invasion and these sequences could undergo regulation related to host response or pathogen influence.
Collapse
|
26
|
Baek J, Lee B, Kwon S, Yoon S. LncRNAnet: long non-coding RNA identification using deep learning. Bioinformatics 2018; 34:3889-3897. [DOI: 10.1093/bioinformatics/bty418] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 05/22/2018] [Indexed: 02/06/2023] Open
Affiliation(s)
- Junghwan Baek
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea
| | - Byunghan Lee
- Electrical and Computer Engineering, Seoul National University, Seoul, Korea
| | - Sunyoung Kwon
- Electrical and Computer Engineering, Seoul National University, Seoul, Korea
| | - Sungroh Yoon
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Korea
- Electrical and Computer Engineering, Seoul National University, Seoul, Korea
| |
Collapse
|
27
|
Lakhotia SC. From Heterochromatin to Long Noncoding RNAs in Drosophila: Expanding the Arena of Gene Function and Regulation. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1008:75-118. [PMID: 28815537 DOI: 10.1007/978-981-10-5203-3_3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Recent years have witnessed a remarkable interest in exploring the significance of pervasive noncoding transcripts in diverse eukaryotes. Classical cytogenetic studies using the Drosophila model system unraveled the perplexing attributes and "functions" of the "gene"-poor heterochromatin. Recent molecular studies in the fly model are likewise revealing the very diverse and significant roles played by long noncoding RNAs (lncRNAs) in development, gene regulation, chromatin organization, cell and nuclear architecture, etc. There has been a rapid increase in the number of identified lncRNAs, although a much larger number still remains unknown. The diversity of modes of actions and functions of the limited number of Drosophila lncRNAs, which have been examined, already reflects the profound roles of such RNAs in generating and sustaining the biological complexities of eukaryotes. Several of the known Drosophila lncRNAs originate as independent sense or antisense transcripts from promoter or intergenic, intronic, or 5'/3'-UTR regions, while many of them are independent genes that produce only lncRNAs or coding as well as noncoding RNAs. The different lncRNAs affect chromatin organization (local or large-scale pan-chromosomal), transcription, RNA processing/stability, or translation either directly through interaction with their target DNA sequences or indirectly by acting as intermediary molecules for specific regulatory proteins or may act as decoys/sinks, or storage sites for specific proteins or groups of proteins, or may provide a structural framework for the assembly of substructures in nucleus/cytoplasm. It is interesting that many of the "functions" alluded to heterochromatin in earlier cytogenetic studies appear to find correlates with the known subtle as well as far-reaching actions of the different small and long noncoding RNAs. Further studies exploiting the very rich and powerful genetic and molecular resources available for the Drosophila model are expected to unravel the mystery underlying the long reach of ncRNAs.
Collapse
Affiliation(s)
- Subhash C Lakhotia
- Cytogenetics Laboratory, Department of Zoology, Banaras Hindu University, Varanasi, 221005, India.
| |
Collapse
|
28
|
Transcriptional and Post-transcriptional Gene Regulation by Long Non-coding RNA. GENOMICS PROTEOMICS & BIOINFORMATICS 2017; 15:177-186. [PMID: 28529100 PMCID: PMC5487525 DOI: 10.1016/j.gpb.2016.12.005] [Citation(s) in RCA: 601] [Impact Index Per Article: 85.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/31/2016] [Revised: 12/08/2016] [Accepted: 12/25/2016] [Indexed: 02/08/2023]
Abstract
Advances in genomics technology over recent years have led to the surprising discovery that the genome is far more pervasively transcribed than was previously appreciated. Much of the newly-discovered transcriptome appears to represent long non-coding RNA (lncRNA), a heterogeneous group of largely uncharacterised transcripts. Understanding the biological function of these molecules represents a major challenge and in this review we discuss some of the progress made to date. One major theme of lncRNA biology seems to be the existence of a network of interactions with microRNA (miRNA) pathways. lncRNA has been shown to act as both a source and an inhibitory regulator of miRNA. At the transcriptional level, a model is emerging whereby lncRNA bridges DNA and protein by binding to chromatin and serving as a scaffold for modifying protein complexes. Such a mechanism can bridge promoters to enhancers or enhancer-like non-coding genes by regulating chromatin looping, as well as conferring specificity on histone modifying complexes by directing them to specific loci.
Collapse
|
29
|
Nishikawa K, Kinjo AR. Essential role of long non-coding RNAs in de novo chromatin modifications: the genomic address code hypothesis. Biophys Rev 2017; 9:73-77. [PMID: 28424740 PMCID: PMC5380698 DOI: 10.1007/s12551-017-0259-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2017] [Accepted: 03/05/2017] [Indexed: 12/19/2022] Open
Abstract
The epigenome, i.e., the whole of chromatin modifications, is transferred from mother to daughter cells during cell differentiation. When de novo chromatin modifications (establishment or erasure of, respectively, new or pre-existing DNA methylations and/or histone modifications) are made in a daughter cell, however, it has a different epigenome than its mother cell. Although de novo chromatin modification is an important event that comprises elementary processes of cell differentiation, its molecular mechanism remains poorly understood. We argue, in this letter, that a key to solving this problem lies in understanding the role of long non-coding RNAs (lncRNAs), a type of RNA that is becoming increasingly prominent in epigenetic studies. Many studies show that lncRNAs form ribonucleoprotein complexes in the nucleus and are involved in chromatin modifications. However, chromatin-modifying enzymes lack the information about genomic positions on which they act. It is known, on the other hand, that a single-stranded RNA in general can bind to a double-stranded DNA to form a triple helix. If each lncRNA forms a ribonucleoprotein complex with chromatin-modifying enzymes on one hand and, at the same time, a triple helix with a genomic region based on its specific nucleotide sequence on the other hand, it can induce de novo chromatin modifications at specific sites. Thus, the great variety of lncRNAs can be explained by the requirement for the diversity of "genomic address codes" specific to their cognate genomic regions where de novo chromatin modifications take place.
Collapse
Affiliation(s)
- Ken Nishikawa
- National Institute of Genetics, Research Organization of Information and Systems, Mishima, Shizuoka 411-8540 Japan
| | - Akira R. Kinjo
- Institute for Protein Research, Osaka University, Suita, Osaka 565-0871 Japan
| |
Collapse
|
30
|
Ramos MJN, Coito JL, Fino J, Cunha J, Silva H, de Almeida PG, Costa MMR, Amâncio S, Paulo OS, Rocheta M. Deep analysis of wild Vitis flower transcriptome reveals unexplored genome regions associated with sex specification. PLANT MOLECULAR BIOLOGY 2017; 93:151-170. [PMID: 27778293 DOI: 10.1007/s11103-016-0553-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2016] [Accepted: 10/15/2016] [Indexed: 06/06/2023]
Abstract
RNA-seq of Vitis during early stages of bud development, in male, female and hermaphrodite flowers, identified new loci outside of annotated gene models, suggesting their involvement in sex establishment. The molecular mechanisms responsible for flower sex specification remain unclear for most plant species. In the case of V. vinifera ssp. vinifera, it is not fully understood what determines hermaphroditism in the domesticated subspecies and male or female flowers in wild dioecious relatives (Vitis vinifera ssp. sylvestris). Here, we describe a de novo assembly of the transcriptome of three flower developmental stages from the three Vitis vinifera flower types. The validation of de novo assembly showed a correlation of 0.825. The main goals of this work were the identification of V. v. sylvestris exclusive transcripts and the characterization of differential gene expression during flower development. RNA from several flower developmental stages was used previously to generate Illumina sequence reads. Through a sequential de novo assembly strategy one comprehensive transcriptome comprising 95,516 non-redundant transcripts was assembled. From this dataset 81,064 transcripts were annotated to V. v. vinifera reference transcriptome and 11,084 were annotated against V. v. vinifera reference genome. Moreover, we found 3368 transcripts that could not be mapped to Vitis reference genome. From all the non-redundant transcripts that were assembled, bioinformatics analysis identified 133 specific of V. v. sylvestris and 516 transcripts differentially expressed among the three flower types. The detection of transcription from areas of the genome not currently annotated suggests active transcription of previously unannotated genomic loci during early stages of bud development.
Collapse
Affiliation(s)
- Miguel Jesus Nunes Ramos
- Universidade de Lisboa, Instituto Superior de Agronomia, LEAF, Linking Landscape, Environment, Agriculture and Food, Tapada da Ajuda, 1359-017, Lisboa, Portugal
| | - João Lucas Coito
- Universidade de Lisboa, Instituto Superior de Agronomia, LEAF, Linking Landscape, Environment, Agriculture and Food, Tapada da Ajuda, 1359-017, Lisboa, Portugal
| | - Joana Fino
- Computational Biology and Population Genomics Group, Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisboa, Portugal
| | - Jorge Cunha
- Instituto Nacional de Investigação Agrária e Veterinária, Quinta d'Almoinha, Dois Portos, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Oeiras, Portugal
| | - Helena Silva
- BioSystems & Integrative Sciences Institute (BioISI), Plant Functional Biology Centre, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
| | - Patrícia Gomes de Almeida
- Development and Evolutionary Morphogenesis, Centre for Ecology, Evolution and Environmental Change, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisboa, Portugal
| | - Maria Manuela Ribeiro Costa
- BioSystems & Integrative Sciences Institute (BioISI), Plant Functional Biology Centre, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
| | - Sara Amâncio
- Universidade de Lisboa, Instituto Superior de Agronomia, LEAF, Linking Landscape, Environment, Agriculture and Food, Tapada da Ajuda, 1359-017, Lisboa, Portugal
| | - Octávio S Paulo
- Computational Biology and Population Genomics Group, Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, 1749-016, Lisboa, Portugal
| | - Margarida Rocheta
- Universidade de Lisboa, Instituto Superior de Agronomia, LEAF, Linking Landscape, Environment, Agriculture and Food, Tapada da Ajuda, 1359-017, Lisboa, Portugal.
| |
Collapse
|
31
|
Abstract
Every ribonucleic acid begins its cellular life as a transcript. If the transcript or its processing product has a function it should be regarded an RNA. Nonfunctional transcripts, by-products from processing, degradation intermediates, even those originating from (functional) RNAs, and non-functional products of transcriptional gene regulation accomplished via the act of transcription, as well as stochastic (co)transcripts could simply be addressed as transcripts (class 0). The copious functional RNAs (class I), often maturing after one or more processing steps, already are systematized into ever expanding sub-classifications ranging from micro RNAs to rRNAs. Established sub-classifications addressing a wide functional diversity remain unaffected. mRNAs (class II) are distinct from any other RNA by virtue of their potential to be translated into (poly)peptide(s) on ribosomes. We are not proposing a novel RNA classification, but wish to add a basic concept with existing terminology (transcript, RNA, and mRNA) that should serve as an additional framework for carefully delineating RNA function from an avalanche of RNA sequencing data. At the same time, this top level hierarchical model should illuminate important principles of RNA evolution and biology thus heightening our awareness that in biology boundaries and categorizations are typically fuzzy.
Collapse
Affiliation(s)
- Jürgen Brosius
- a Institute of Experimental Pathology, ZMBE, University of Münster , Von-Esmarch-Str. 56, 48149 ; Münster , Germany.,b Institute of Evolutionary and Medical Genomics, Brandenburg Medical School (MHB) , Fehrbelliner Str. 38, 16816 ; Germany
| | - Carsten A Raabe
- a Institute of Experimental Pathology, ZMBE, University of Münster , Von-Esmarch-Str. 56, 48149 ; Münster , Germany.,b Institute of Evolutionary and Medical Genomics, Brandenburg Medical School (MHB) , Fehrbelliner Str. 38, 16816 ; Germany
| |
Collapse
|
32
|
Weikard R, Demasius W, Kuehn C. Mining long noncoding RNA in livestock. Anim Genet 2016; 48:3-18. [DOI: 10.1111/age.12493] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/16/2016] [Indexed: 02/01/2023]
Affiliation(s)
- R. Weikard
- Institute Genome Biology; Leibniz Institute for Farm Animal Biology (FBN); 18196 Dummerstorf Germany
| | - W. Demasius
- Institute Genome Biology; Leibniz Institute for Farm Animal Biology (FBN); 18196 Dummerstorf Germany
| | - C. Kuehn
- Institute Genome Biology; Leibniz Institute for Farm Animal Biology (FBN); 18196 Dummerstorf Germany
- Faculty of Agricultural and Environmental Sciences; University Rostock; 18059 Rostock Germany
| |
Collapse
|
33
|
Arvaniti E, Moulos P, Vakrakou A, Chatziantoniou C, Chadjichristos C, Kavvadas P, Charonis A, Politis PK. Whole-transcriptome analysis of UUO mouse model of renal fibrosis reveals new molecular players in kidney diseases. Sci Rep 2016; 6:26235. [PMID: 27189340 PMCID: PMC4870569 DOI: 10.1038/srep26235] [Citation(s) in RCA: 81] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 04/28/2016] [Indexed: 12/22/2022] Open
Abstract
Transcriptome analysis by RNA-seq technology allows novel insights into gene expression and regulatory networks in health and disease. To better understand the molecular basis of renal fibrosis, we performed RNA-seq analysis in the Unilateral Ureteric Obstruction (UUO) mouse model. We analysed sham operated, 2- and 8-day post-ligation renal tissues. Thousands of genes with statistical significant changes in their expression were identified and classified into cellular processes and molecular pathways. Many novel protein-coding genes were identified, including critical transcription factors with important regulatory roles in other tissues and diseases. Emphasis was placed on long non-coding RNAs (lncRNAs), a class of molecular regulators of multiple and diverse cellular functions. Selected lncRNA genes were further studied and their transcriptional activity was confirmed. For three of them, their transcripts were also examined in other mouse models of nephropathies and their up- or down-regulation was found similar to the UUO model. In vitro experiments confirmed that one selected lncRNA is independent of TGFβ or IL1b stimulation but can influence the expression of fibrosis-related proteins and the cellular phenotype. These data provide new information about the involvement of protein-coding and lncRNA genes in nephropathies, which can become novel diagnostic and therapeutic targets in the near future.
Collapse
Affiliation(s)
- Eleni Arvaniti
- Center for Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | | | - Athina Vakrakou
- Center for Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | | | | | | | - Aristidis Charonis
- Center for Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Panagiotis K. Politis
- Center for Basic Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| |
Collapse
|
34
|
Neme R, Tautz D. Fast turnover of genome transcription across evolutionary time exposes entire non-coding DNA to de novo gene emergence. eLife 2016; 5:e09977. [PMID: 26836309 PMCID: PMC4829534 DOI: 10.7554/elife.09977] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2015] [Accepted: 02/01/2016] [Indexed: 01/17/2023] Open
Abstract
Deep sequencing analyses have shown that a large fraction of genomes is transcribed, but the significance of this transcription is much debated. Here, we characterize the phylogenetic turnover of poly-adenylated transcripts in a comprehensive sampling of taxa of the mouse (genus Mus), spanning a phylogenetic distance of 10 Myr. Using deep RNA sequencing we find that at a given sequencing depth transcriptome coverage becomes saturated within a taxon, but keeps extending when compared between taxa, even at this very shallow phylogenetic level. Our data show a high turnover of transcriptional states between taxa and that no major transcript-free islands exist across evolutionary time. This suggests that the entire genome can be transcribed into poly-adenylated RNA when viewed at an evolutionary time scale. We conclude that any part of the non-coding genome can potentially become subject to evolutionary functionalization via de novo gene evolution within relatively short evolutionary time spans. DOI:http://dx.doi.org/10.7554/eLife.09977.001 Traditionally, the genome – the sum total of DNA within a cell – was thought to be divided into genes and ‘non-coding’ regions. Genes are copied, or “transcribed”, into molecules called RNA that perform essential tasks in the cell. The roles of the non-coding regions were often less clear, although it has since become apparent that some are also transcribed and generate low levels of RNA molecules. However, many debate how significant this transcription is to living organisms. Neme and Tautz have now used a technique called deep RNA sequencing to study the RNA molecules produced in several different species and types of mice whose last common ancestor lived 10 million years ago. Different species produced RNA molecules from different portions – both genes and non-coding regions – of their genomes. Comparing these RNA sequences suggests that changes to the regions that are transcribed occur relatively quickly for a large portion of the genome. Furthermore, there have been no significant areas of the common ancestor’s genome that have not been transcribed at some point in at least one of its descendent species. This therefore suggests that over a relatively short evolutionary period, any part of the genome can acquire the ability to be transcribed and potentially form a new gene. The next challenge is to find out how often these transcribed non-coding parts of the genome show important biochemical activities, and how they find their way into becoming new genes. DOI:http://dx.doi.org/10.7554/eLife.09977.002
Collapse
Affiliation(s)
- Rafik Neme
- Max-Planck Institute for Evolutionary Biology, Plön, Germany
| | - Diethard Tautz
- Max-Planck Institute for Evolutionary Biology, Plön, Germany
| |
Collapse
|
35
|
Ansseau E, Eidahl JO, Lancelot C, Tassin A, Matteotti C, Yip C, Liu J, Leroy B, Hubeau C, Gerbaux C, Cloet S, Wauters A, Zorbo S, Meyer P, Pirson I, Laoudj-Chenivesse D, Wattiez R, Harper SQ, Belayew A, Coppée F. Homologous Transcription Factors DUX4 and DUX4c Associate with Cytoplasmic Proteins during Muscle Differentiation. PLoS One 2016; 11:e0146893. [PMID: 26816005 PMCID: PMC4729438 DOI: 10.1371/journal.pone.0146893] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Accepted: 12/24/2015] [Indexed: 12/26/2022] Open
Abstract
Hundreds of double homeobox (DUX) genes map within 3.3-kb repeated elements dispersed in the human genome and encode DNA-binding proteins. Among these, we identified DUX4, a potent transcription factor that causes facioscapulohumeral muscular dystrophy (FSHD). In the present study, we performed yeast two-hybrid screens and protein co-purifications with HaloTag-DUX fusions or GST-DUX4 pull-down to identify protein partners of DUX4, DUX4c (which is identical to DUX4 except for the end of the carboxyl terminal domain) and DUX1 (which is limited to the double homeodomain). Unexpectedly, we identified and validated (by co-immunoprecipitation, GST pull-down, co-immunofluorescence and in situ Proximal Ligation Assay) the interaction of DUX4, DUX4c and DUX1 with type III intermediate filament protein desmin in the cytoplasm and at the nuclear periphery. Desmin filaments link adjacent sarcomere at the Z-discs, connect them to sarcolemma proteins and interact with mitochondria. These intermediate filament also contact the nuclear lamina and contribute to positioning of the nuclei. Another Z-disc protein, LMCD1 that contains a LIM domain was also validated as a DUX4 partner. The functionality of DUX4 or DUX4c interactions with cytoplasmic proteins is underscored by the cytoplasmic detection of DUX4/DUX4c upon myoblast fusion. In addition, we identified and validated (by co-immunoprecipitation, co-immunofluorescence and in situ Proximal Ligation Assay) as DUX4/4c partners several RNA-binding proteins such as C1QBP, SRSF9, RBM3, FUS/TLS and SFPQ that are involved in mRNA splicing and translation. FUS and SFPQ are nuclear proteins, however their cytoplasmic translocation was reported in neuronal cells where they associated with ribonucleoparticles (RNPs). Several other validated or identified DUX4/DUX4c partners are also contained in mRNP granules, and the co-localizations with cytoplasmic DAPI-positive spots is in keeping with such an association. Large muscle RNPs were recently shown to exit the nucleus via a novel mechanism of nuclear envelope budding. Following DUX4 or DUX4c overexpression in muscle cell cultures, we observed their association with similar nuclear buds. In conclusion, our study demonstrated unexpected interactions of DUX4/4c with cytoplasmic proteins playing major roles during muscle differentiation. Further investigations are on-going to evaluate whether these interactions play roles during muscle regeneration as previously suggested for DUX4c.
Collapse
Affiliation(s)
- Eugénie Ansseau
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Jocelyn O. Eidahl
- Center for Gene Therapy, Research Institute at Nationwide Children's Hospital, Columbus, OH, United States of America
| | - Céline Lancelot
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Alexandra Tassin
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Christel Matteotti
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Cassandre Yip
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Jian Liu
- Center for Gene Therapy, Research Institute at Nationwide Children's Hospital, Columbus, OH, United States of America
| | - Baptiste Leroy
- Laboratory of Proteomic and Microbiology, Research Institute for Biosciences, University of Mons, Mons, Belgium
| | - Céline Hubeau
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Cécile Gerbaux
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Samuel Cloet
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Armelle Wauters
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Sabrina Zorbo
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Pierre Meyer
- Pediatric Department, CHRU Montpellier, Montpellier, France
| | - Isabelle Pirson
- I.R.I.B.H.M., Free University of Brussels, Brussels, Belgium
| | | | - Ruddy Wattiez
- Laboratory of Proteomic and Microbiology, Research Institute for Biosciences, University of Mons, Mons, Belgium
| | - Scott Q. Harper
- Center for Gene Therapy, Research Institute at Nationwide Children's Hospital, Columbus, OH, United States of America
- Department of Pediatrics, Ohio State University College of Medicine, Columbus, OH, United States of America
| | - Alexandra Belayew
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
| | - Frédérique Coppée
- Laboratory of Molecular Biology, Research Institute for Health Sciences and Technology, University of Mons, Mons, Belgium
- * E-mail:
| |
Collapse
|
36
|
Holl HM, Gao S, Fei Z, Andrews C, Brooks SA. Generation of a de novo transcriptome from equine lamellar tissue. BMC Genomics 2015; 16:739. [PMID: 26432030 PMCID: PMC4592545 DOI: 10.1186/s12864-015-1948-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 09/22/2015] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Laminitis, the structural failure of interdigitated tissue that suspends the distal skeleton within the hoof capsule, is a devastating disease that is the second leading cause of both lameness and euthanasia in the horse. Current transcriptomic research focuses on the expression of known genes. However, as this tissue is quite unique and equine gene annotation is largely derived from computational predictions, there are likely yet uncharacterized transcripts that may be involved in the etiology of laminitis. In order to create a novel annotation resource, we performed whole transcriptome sequencing of sagittal lamellar sections from one control and two laminitis affected horses. RESULTS Whole transcriptome sequencing of the three samples resulted in 113 million reads. Overall, 88 % of the reads mapped to the equCab2 reference genome, allowing for the identification of 119,430 SNPs. The de novo assembly generated around 75,000 transcripts, of which 36,000 corresponded to known annotations. Annotated transcript models are hosted in a public data repository and thus can be easily accessed or loaded into genome browsers. RT-PCR of 12 selected assemblies confirmed structure and expression in lamellar tissue. CONCLUSIONS Transcriptome sequencing represents a powerful tool to expand on equine annotation and identify novel targets for further laminitis research.
Collapse
Affiliation(s)
- Heather M Holl
- Department of Animal Sciences, University of Florida, Gainesville, FL, 32611, USA.
| | - Shan Gao
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, 14853, USA.
| | - Zhangjun Fei
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, 14853, USA.
| | - Caroline Andrews
- Laboratory of Molecular Immunoregulation, National Cancer Institute, Bethesda, MD, 20892, USA.
| | - Samantha A Brooks
- Department of Animal Sciences, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
37
|
Raabe CA, Brosius J. Does every transcript originate from a gene? Ann N Y Acad Sci 2015; 1341:136-48. [PMID: 25847549 DOI: 10.1111/nyas.12741] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Revised: 02/05/2015] [Accepted: 02/11/2015] [Indexed: 12/20/2022]
Abstract
Outdated gene definitions favored regions corresponding to mature messenger RNAs, in particular, the open reading frame. In eukaryotes, the intergenic space was widely regarded nonfunctional and devoid of RNA transcription. Original concepts were based on the assumption that RNA expression was restricted to known protein-coding genes and a few so-called structural RNA genes, such as ribosomal RNAs or transfer RNAs. With the discovery of introns and, more recently, sensitive techniques for monitoring genome-wide transcription, this view had to be substantially modified. Tiling microarrays and RNA deep sequencing revealed myriads of transcripts, which cover almost entire genomes. The tremendous complexity of non-protein-coding RNA transcription has to be integrated into novel gene definitions. Despite an ever-growing list of functional RNAs, questions concerning the mass of identified transcripts are under dispute. Here, we examined genome-wide transcription from various angles, including evolutionary considerations, and suggest, in analogy to novel alternative splice variants that do not persist, that the vast majority of transcripts represent raw material for potential, albeit rare, exaptation events.
Collapse
Affiliation(s)
- Carsten A Raabe
- Institute of Experimental Pathology, ZMBE, University of Münster, Münster, Germany
| | | |
Collapse
|
38
|
Long MD, Sucheston-Campbell LE, Campbell MJ. Vitamin D receptor and RXR in the post-genomic era. J Cell Physiol 2015; 230:758-66. [PMID: 25335912 DOI: 10.1002/jcp.24847] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 10/16/2014] [Indexed: 12/25/2022]
Abstract
Following the elucidation of the human genome and components of the epigenome, it is timely to revisit what is known of vitamin D receptor (VDR) function. Early transcriptomic studies using microarray approaches focused on the protein coding mRNA that were regulated by the VDR, usually following treatment with ligand. These studies quickly established the approximate size and surprising diversity of the VDR transcriptome, revealing it to be highly heterogenous and cell type and time dependent. Investigators also considered VDR regulation of non-protein coding RNA and again, cell and time dependency was observed. Attempts to integrate mRNA and miRNA regulation patterns are beginning to reveal patterns of co-regulation and interaction that allow for greater control of mRNA expression, and the capacity to govern more complex cellular events. Alternative splicing in the trasncriptome has emerged as a critical process in transcriptional control and there is evidence of the VDR interacting with components of the splicesome. ChIP-Seq approaches have proved to be pivotal to reveal the diversity of the VDR binding choices across cell types and following treatment, and have revealed that the majority of these are non-canonical in nature. The underlying causes driving the diversity of VDR binding choices remain enigmatic. Finally, genetic variation has emerged as important to impact the transcription factor affinity towards genomic binding sites, and recently the impact of this on VDR function has begun to be considered.
Collapse
Affiliation(s)
- Mark D Long
- Department of Pharmacology and Therapeutics, Roswell Park Cancer Institute, Buffalo, New York
| | | | | |
Collapse
|
39
|
St Laurent G, Wahlestedt C, Kapranov P. The Landscape of long noncoding RNA classification. Trends Genet 2015; 31:239-51. [PMID: 25869999 DOI: 10.1016/j.tig.2015.03.007] [Citation(s) in RCA: 836] [Impact Index Per Article: 92.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Revised: 03/09/2015] [Accepted: 03/12/2015] [Indexed: 12/12/2022]
Abstract
Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long noncoding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the noncoding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual unambiguous classification framework results in a number of challenges in the annotation and interpretation of noncoding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel the function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then, we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets.
Collapse
Affiliation(s)
- Georges St Laurent
- St. Laurent Institute, 317 New Boston St., Suite 201, Woburn, MA 01801 USA; Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, 185 Meeting Street, Providence, RI 02912, USA
| | - Claes Wahlestedt
- Center for Therapeutic Innovation and Department of Psychiatry and Behavioral Sciences, University of Miami Miller School of Medicine, 1501 NW 10th Ave, Miami, FL 33136 USA.
| | - Philipp Kapranov
- Institute of Genomics, School of Biomedical Sciences, Huaqiao Univerisity, 668 Jimei Road, Xiamen, China 361021; St. Laurent Institute, 317 New Boston St., Suite 201, Woburn, MA 01801 USA.
| |
Collapse
|
40
|
Chen ZX, Sturgill D, Qu J, Jiang H, Park S, Boley N, Suzuki AM, Fletcher AR, Plachetzki DC, FitzGerald PC, Artieri CG, Atallah J, Barmina O, Brown JB, Blankenburg KP, Clough E, Dasgupta A, Gubbala S, Han Y, Jayaseelan JC, Kalra D, Kim YA, Kovar CL, Lee SL, Li M, Malley JD, Malone JH, Mathew T, Mattiuzzo NR, Munidasa M, Muzny DM, Ongeri F, Perales L, Przytycka TM, Pu LL, Robinson G, Thornton RL, Saada N, Scherer SE, Smith HE, Vinson C, Warner CB, Worley KC, Wu YQ, Zou X, Cherbas P, Kellis M, Eisen MB, Piano F, Kionte K, Fitch DH, Sternberg PW, Cutter AD, Duff MO, Hoskins RA, Graveley BR, Gibbs RA, Bickel PJ, Kopp A, Carninci P, Celniker SE, Oliver B, Richards S. Comparative validation of the D. melanogaster modENCODE transcriptome annotation. Genome Res 2015; 24:1209-23. [PMID: 24985915 PMCID: PMC4079975 DOI: 10.1101/gr.159384.113] [Citation(s) in RCA: 111] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Accurate gene model annotation of reference genomes is critical for making them useful. The modENCODE project has improved the D. melanogaster genome annotation by using deep and diverse high-throughput data. Since transcriptional activity that has been evolutionarily conserved is likely to have an advantageous function, we have performed large-scale interspecific comparisons to increase confidence in predicted annotations. To support comparative genomics, we filled in divergence gaps in the Drosophila phylogeny by generating draft genomes for eight new species. For comparative transcriptome analysis, we generated mRNA expression profiles on 81 samples from multiple tissues and developmental stages of 15 Drosophila species, and we performed cap analysis of gene expression in D. melanogaster and D. pseudoobscura. We also describe conservation of four distinct core promoter structures composed of combinations of elements at three positions. Overall, each type of genomic feature shows a characteristic divergence rate relative to neutral models, highlighting the value of multispecies alignment in annotating a target genome that should prove useful in the annotation of other high priority genomes, especially human and other mammalian genomes that are rich in noncoding sequences. We report that the vast majority of elements in the annotation are evolutionarily conserved, indicating that the annotation will be an important springboard for functional genetic testing by the Drosophila community.
Collapse
Affiliation(s)
- Zhen-Xia Chen
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - David Sturgill
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Jiaxin Qu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Huaiyang Jiang
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Soo Park
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Nathan Boley
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Ana Maria Suzuki
- Technology Development Group, RIKEN Omics Science Center and RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama City, Kanagawa, Japan 230-0045
| | - Anthony R Fletcher
- Division of Computational Bioscience, Center For Information Technology, National Institutes of Health, Bethesda, Maryland 20814, USA
| | - David C Plachetzki
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Peter C FitzGerald
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Carlo G Artieri
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Joel Atallah
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Olga Barmina
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - James B Brown
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Kerstin P Blankenburg
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Emily Clough
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Abhijit Dasgupta
- Clinical Trials and Outcomes Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Sai Gubbala
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Yi Han
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Joy C Jayaseelan
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Divya Kalra
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Yoo-Ah Kim
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Christie L Kovar
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Sandra L Lee
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Mingmei Li
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - James D Malley
- Division of Computational Bioscience, Center For Information Technology, National Institutes of Health, Bethesda, Maryland 20814, USA
| | - John H Malone
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Tittu Mathew
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Nicolas R Mattiuzzo
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Mala Munidasa
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Donna M Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Fiona Ongeri
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Lora Perales
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Teresa M Przytycka
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Ling-Ling Pu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Garrett Robinson
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Rebecca L Thornton
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Nehad Saada
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Steven E Scherer
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Harold E Smith
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Charles Vinson
- National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Crystal B Warner
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Kim C Worley
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Yuan-Qing Wu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Xiaoyan Zou
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Peter Cherbas
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 20139, USA
| | - Michael B Eisen
- Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Fabio Piano
- Department of Biology, New York University, New York, New York 10003, USA
| | - Karin Kionte
- Department of Biology, New York University, New York, New York 10003, USA
| | - David H Fitch
- Department of Biology, New York University, New York, New York 10003, USA
| | - Paul W Sternberg
- HHMI and Division of Biology, California Institute of Technology, Pasadena, California 91125, USA
| | - Asher D Cutter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B2, Canada
| | - Michael O Duff
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030-6403, USA
| | - Roger A Hoskins
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Brenton R Graveley
- Department of Genetics and Developmental Biology, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, Connecticut 06030-6403, USA
| | - Richard A Gibbs
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Peter J Bickel
- Department of Statistics, University of California, Berkeley, California 94720, USA
| | - Artyom Kopp
- Department of Evolution and Ecology, University of California, Davis, California 95616, USA
| | - Piero Carninci
- Technology Development Group, RIKEN Omics Science Center and RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama City, Kanagawa, Japan 230-0045
| | - Susan E Celniker
- Department of Genome Dynamics, Life Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Brian Oliver
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Stephen Richards
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
41
|
Where to begin? Mapping transcription start sites genome-wide in Escherichia coli. J Bacteriol 2014; 197:4-6. [PMID: 25331438 DOI: 10.1128/jb.02410-14] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Recent genome-wide studies of bacterial transcription have revealed large numbers of promoters located inside genes. In this issue of the Journal of Bacteriology, Thomason and colleagues (J. Bacteriol. 197:18-28, 2015, doi:10.1128/JB.02096-14) map transcription start sites in Escherichia coli on an unprecedented scale. This work provides important insights into the regulation of transcripts that initiate inside genes and sources of variability between studies aimed at identifying these RNAs.
Collapse
|
42
|
Aldrich JC, Maggert KA. Simple quantitative PCR approach to reveal naturally occurring and mutation-induced repetitive sequence variation on the Drosophila Y chromosome. PLoS One 2014; 9:e109906. [PMID: 25285439 PMCID: PMC4186871 DOI: 10.1371/journal.pone.0109906] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Accepted: 09/13/2014] [Indexed: 02/06/2023] Open
Abstract
Heterochromatin is a significant component of the human genome and the genomes of most model organisms. Although heterochromatin is thought to be largely non-coding, it is clear that it plays an important role in chromosome structure and gene regulation. Despite a growing awareness of its functional significance, the repetitive sequences underlying some heterochromatin remain relatively uncharacterized. We have developed a real-time quantitative PCR-based method for quantifying simple repetitive satellite sequences and have used this technique to characterize the heterochromatic Y chromosome of Drosophila melanogaster. In this report, we validate the approach, identify previously unknown satellite sequence copy number polymorphisms in Y chromosomes from different geographic sources, and show that a defect in heterochromatin formation can induce similar copy number polymorphisms in a laboratory strain. These findings provide a simple method to investigate the dynamic nature of repetitive sequences and characterize conditions which might give rise to long-lasting alterations in DNA sequence.
Collapse
Affiliation(s)
- John C. Aldrich
- Department of Biology, Texas A&M University, College Station, Texas, United States of America
| | - Keith A. Maggert
- Department of Biology, Texas A&M University, College Station, Texas, United States of America
| |
Collapse
|
43
|
Ulveling D, Dinger ME, Francastel C, Hubé F. Identification of a dinucleotide signature that discriminates coding from non-coding long RNAs. Front Genet 2014; 5:316. [PMID: 25250049 PMCID: PMC4158813 DOI: 10.3389/fgene.2014.00316] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Accepted: 08/22/2014] [Indexed: 11/13/2022] Open
Abstract
To date, the main criterion by which long ncRNAs (lncRNAs) are discriminated from mRNAs is based on the capacity of the transcripts to encode a protein. However, it becomes important to identify non-ORF-based sequence characteristics that can be used to parse between ncRNAs and mRNAs. In this study, we first established an extremely selective workflow to define a highly refined database of lncRNAs which was used for comparison with mRNAs. Then using this highly selective collection of lncRNAs, we found the CG dinucleotide frequencies were clearly distinct. In addition, we showed that the bias in CG dinucleotide frequency was conserved in human and mouse genomes. We propose that this sequence feature will serve as a useful classifier in transcript classification pipelines. We also suggest that our refined database of "bona fide" lncRNAs will be valuable for the discovery of other sequence characteristics distinct to lncRNAs.
Collapse
Affiliation(s)
- Damien Ulveling
- CNRS UMR7216, Epigenetics and Cell Fate, Université Paris Diderot, Sorbonne Paris Cité Paris, France
| | - Marcel E Dinger
- The University of Queensland Diamantina Institute, The University of Queensland Brisbane, QLD, Australia
| | - Claire Francastel
- CNRS UMR7216, Epigenetics and Cell Fate, Université Paris Diderot, Sorbonne Paris Cité Paris, France
| | - Florent Hubé
- CNRS UMR7216, Epigenetics and Cell Fate, Université Paris Diderot, Sorbonne Paris Cité Paris, France
| |
Collapse
|
44
|
The noncoding RNA revolution. Int J Biochem Cell Biol 2014; 54:287. [PMID: 25091701 DOI: 10.1016/j.biocel.2014.07.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
45
|
Brosius J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb Perspect Biol 2014; 6:a016089. [PMID: 25081515 DOI: 10.1101/cshperspect.a016089] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery.
Collapse
Affiliation(s)
- Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
46
|
Baboo S, Cook PR. "Dark matter" worlds of unstable RNA and protein. Nucleus 2014; 5:281-6. [PMID: 25482115 PMCID: PMC4152340 DOI: 10.4161/nucl.29577] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Revised: 06/10/2014] [Accepted: 06/13/2014] [Indexed: 12/11/2022] Open
Abstract
Astrophysicists use the term "dark matter" to describe the majority of the matter and/or energy in the universe that is hidden from view, and biologists now apply it to the new families of RNA they are uncovering. We review evidence for an analogous hidden world containing peptides. The critical experiments involved pulse-labeling human cells with tagged amino acids for periods as short as five seconds. Results are extraordinary in two respects: both nucleus and cytoplasm become labeled, and most signals disappear with a half-life of less than one minute. Just as the synthesis of each mature mRNA is regulated by the abortive production of hundreds of shorter transcripts that are quickly degraded, it seems that the synthesis of each full-length protein in the stable proteome is regulated by an apparently wasteful production and degradation of shorter peptides. Some of the nuclear synthesis is probably a byproduct of nuclear ribosomes proofreading newly-made RNA for inappropriately-placed termination codons (a process that triggers "nonsense-mediated decay"). We speculate that some "dark-matter" peptides will play other important roles in the cell.
Collapse
Affiliation(s)
- Sabyasachi Baboo
- Sir William Dunn School of Pathology; University of Oxford; Oxford, UK
| | - Peter R Cook
- Sir William Dunn School of Pathology; University of Oxford; Oxford, UK
| |
Collapse
|
47
|
Lund SH, Gudbjartsson DF, Rafnar T, Sigurdsson A, Gudjonsson SA, Gudmundsson J, Stefansson K, Stefansson G. A method for detecting long non-coding RNAs with tiled RNA expression microarrays. PLoS One 2014; 9:e99899. [PMID: 24937006 PMCID: PMC4061049 DOI: 10.1371/journal.pone.0099899] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Accepted: 05/20/2014] [Indexed: 01/08/2023] Open
Abstract
Long non-coding ribonucleic acids (lncRNAs) have been proposed as biomarkers in prostate cancer. This paper proposes a selection method which uses data from tiled microarrays to identify relatively long regions of moderate expression independent of the microarray platform and probe design. The method is used to search for candidate long non-coding ribonucleic acids (lncRNAs) at locus 8q24 and is run on three independent experiments which all use samples from prostate cancer patients. The robustness of the method is tested by utilizing repeated copies of tiled probes. The method shows high consistency between experiments that used the same samples, but different probe layout. There also is statistically significant consistency when comparing experiments with different samples. The method selected the long non-coding ribonucleic acid PCNCR1 in all three experiments.
Collapse
Affiliation(s)
- Sigrun Helga Lund
- Faculty of Physical Sciences, University of Iceland, Reykjavik, Iceland
- * E-mail:
| | | | | | | | | | | | | | - Gunnar Stefansson
- Faculty of Physical Sciences, University of Iceland, Reykjavik, Iceland
| |
Collapse
|
48
|
St Laurent G, Vyatkin Y, Kapranov P. Dark matter RNA illuminates the puzzle of genome-wide association studies. BMC Med 2014; 12:97. [PMID: 24924000 PMCID: PMC4054906 DOI: 10.1186/1741-7015-12-97] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/07/2014] [Accepted: 05/22/2014] [Indexed: 12/12/2022] Open
Abstract
In the past decade, numerous studies have made connections between sequence variants in human genomes and predisposition to complex diseases. However, most of these variants lie outside of the charted regions of the human genome whose function we understand; that is, the sequences that encode proteins. Consequently, the general concept of a mechanism that translates these variants into predisposition to diseases has been lacking, potentially calling into question the validity of these studies. Here we make a connection between the growing class of apparently functional RNAs that do not encode proteins and whose function we do not yet understand (the so-called 'dark matter' RNAs) and the disease-associated variants. We review advances made in a different genomic mapping effort - unbiased profiling of all RNA transcribed from the human genome - and provide arguments that the disease-associated variants exert their effects via perturbation of regulatory properties of non-coding RNAs existing in mammalian cells.
Collapse
Affiliation(s)
| | | | - Philipp Kapranov
- St, Laurent Institute, 317 New Boston St, Suite 201, Woburn, MA 01801, USA.
| |
Collapse
|
49
|
Baboo S, Bhushan B, Jiang H, Grovenor CRM, Pierre P, Davis BG, Cook PR. Most human proteins made in both nucleus and cytoplasm turn over within minutes. PLoS One 2014; 9:e99346. [PMID: 24911415 PMCID: PMC4050049 DOI: 10.1371/journal.pone.0099346] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Accepted: 05/13/2014] [Indexed: 12/25/2022] Open
Abstract
In bacteria, protein synthesis can be coupled to transcription, but in eukaryotes it is believed to occur solely in the cytoplasm. Using pulses as short as 5 s, we find that three analogues – L-azidohomoalanine, puromycin (detected after attaching fluors using ‘click’ chemistry or immuno-labeling), and amino acids tagged with ‘heavy’ 15N and 13C (detected using secondary ion mass spectrometry) – are incorporated into the nucleus and cytoplasm in a process sensitive to translational inhibitors. The nuclear incorporation represents a significant fraction of the total, and labels in both compartments have half-lives of less than a minute; results are consistent with most newly-made peptides being destroyed soon after they are made. As nascent RNA bearing a premature termination codon (detected by fluorescence in situ hybridization) is also eliminated by a mechanism sensitive to a translational inhibitor, the nuclear turnover of peptides is probably a by-product of proof-reading the RNA for stop codons (a process known as nonsense-mediated decay). We speculate that the apparently-wasteful turnover of this previously-hidden (‘dark-matter’) world of peptide is involved in regulating protein production.
Collapse
Affiliation(s)
- Sabyasachi Baboo
- Sir William Dunn School of Pathology, University of Oxford, Oxford, United Kingdom
| | - Bhaskar Bhushan
- Chemistry Research Laboratory, Department of Chemistry, University of Oxford, Oxford, United Kingdom
| | - Haibo Jiang
- Department of Materials, University of Oxford, Oxford, United Kingdom
| | | | - Philippe Pierre
- Centre d′Immunologie de Marseille-Luminy, Aix-Marseille Université, Marseille, France
- Institut National de la Santé et de la Recherche Médicale, U1104, Marseille, France
- Centre National de la Recherche Scientifique, Unités Mixtes de Recherche 7280, Marseille, France
| | - Benjamin G. Davis
- Chemistry Research Laboratory, Department of Chemistry, University of Oxford, Oxford, United Kingdom
| | - Peter R. Cook
- Sir William Dunn School of Pathology, University of Oxford, Oxford, United Kingdom
- * E-mail:
| |
Collapse
|
50
|
Campbell MJ. Vitamin D and the RNA transcriptome: more than mRNA regulation. Front Physiol 2014; 5:181. [PMID: 24860511 PMCID: PMC4030167 DOI: 10.3389/fphys.2014.00181] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2014] [Accepted: 04/21/2014] [Indexed: 12/23/2022] Open
Abstract
The GRCh37.p13 primary assembly of the human genome contains 20805 protein coding mRNA, and 37147 non-protein coding genes and pseudogenes that as a result of RNA processing and editing generate 196501 gene transcripts. Given the size and diversity of the human transcriptome, it is timely to revisit what is known of VDR function in the regulation and targeting of transcription. Early transcriptomic studies using microarray approaches focused on the protein coding mRNA that were regulated by the VDR, usually following treatment with ligand. These studies quickly established the approximate size, and surprising diversity of the VDR transcriptome, revealing it to be highly heterogenous and cell type and time dependent. With the discovery of microRNA, investigators also considered VDR regulation of these non-protein coding RNA. Again, cell and time dependency has emerged. Attempts to integrate mRNA and miRNA regulation patterns are beginning to reveal patterns of co-regulation and interaction that allow for greater control of mRNA expression, and the capacity to govern more complex cellular events. As the awareness of the diversity of non-coding RNA increases, it is increasingly likely it will be revealed that VDR actions are mediated through these molecules also. Key knowledge gaps remain over the VDR transcriptome. The causes for the cell and type dependent transcriptional heterogenetiy remain enigmatic. ChIP-Seq approaches have confirmed that VDR binding choices differ very significantly by cell type, but as yet the underlying causes distilling VDR binding choices are unclear. Similarly, it is clear that many of the VDR binding sites are non-canonical in nature but again the mechanisms underlying these interactions are unclear. Finally, although alternative splicing is clearly a very significant process in cellular transcriptional control, the lack of RNA-Seq data centered on VDR function are currently limiting the global assessment of the VDR transcriptome. VDR focused research that complements publically available data (e.g., ENCODE Birney et al., 2007; Birney, 2012), TCGA (Strausberg et al., 2002), GTEx (Consortium, 2013) will enable these questions to be addressed through large-scale data integration efforts.
Collapse
Affiliation(s)
- Moray J Campbell
- Department of Pharmacology and Therapeutics, Roswell Park Cancer Institute Buffalo, NY, USA
| |
Collapse
|