1
|
Barai P, Biswas S, Verma P, Duncan EM. RNaseH-based ribodepletion of total planarian RNA improves detection of longer and non-polyadenylated transcripts. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.20.604429. [PMID: 39071286 PMCID: PMC11275719 DOI: 10.1101/2024.07.20.604429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
The overwhelming majority of RNA species isolated from cells or tissues using organic extraction are ribosomal RNAs (rRNA), whereas a relatively small percentage are messenger RNAs (mRNA). For studies that seek to detect mRNA transcripts and measure changes in their expression, this lopsided ratio of desired transcripts to undesired transcripts creates a significant challenge to obtaining sensitive and reproducible results. One method for improving mRNA detection is to selectively amplify polyadenylated (polyA) mRNA molecules when generating RNA-seq libraries, a strategy that is generally very successful in many species. However, this strategy is less effective when starting with total RNA from some species e.g., the planarian species Schmidtea mediterranea (S.med), as it generates libraries that still contain significant and variable amounts of rRNA reads. Further, commercially available ribodepletion kits do not efficiently deplete rRNAs from these samples because their sequences are divergent from mammalian rRNAs. Here we report a customized, optimized, and economical ribodepletion strategy than allows the generation of comprehensive RNA-seq libraries with less than one percent rRNA contamination. We show that this method improves transcript detection, particularly for those without polyA tails (e.g., core histones) and those that are relatively long (e.g., microtubule motor proteins). Using this custom ribodepletion approach, we also detected many transcripts that are not represented in the most recent set of S.med gene annotations, including a subset that are likely expressed transposable elements (TEs). To facilitate future differential expression analyses of these newly identified loci, we created both an annotation file of the new loci we identified and a bioinformatic pipeline for generating additional annotations from future libraries. As significant recent research shows that TE activation is regulated and functionally important, the resources provided here will provide a starting point for investigating such mechanisms in planarians and other species with less conserved rRNA sequences.
Collapse
Affiliation(s)
- Pallob Barai
- Department of Biology, University of Kentucky, Lexington KY 40506
| | - Shishir Biswas
- Department of Biology, University of Kentucky, Lexington KY 40506
| | - Prince Verma
- Department of Biology, University of Kentucky, Lexington KY 40506
| | | |
Collapse
|
2
|
Wan CY, Davis J, Chauhan M, Gleeson J, Prawer YJ, De Paoli-Iseppi R, Wells C, Choi J, Clark M. IsoVis - a webserver for visualization and annotation of alternative RNA isoforms. Nucleic Acids Res 2024; 52:W341-W347. [PMID: 38709877 PMCID: PMC11223830 DOI: 10.1093/nar/gkae343] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 03/28/2024] [Accepted: 04/18/2024] [Indexed: 05/08/2024] Open
Abstract
Genes commonly express multiple RNA products (RNA isoforms), which differ in exonic content and can have different functions. Making sense of the plethora of known and novel RNA isoforms being identified by transcriptomic approaches requires a user-friendly way to visualize gene isoforms and how they differ in exonic content, expression levels and potential functions. Here we introduce IsoVis, a freely available webserver that accepts user-supplied transcriptomic data and visualizes the expressed isoforms in a clear, intuitive manner. IsoVis contains numerous features, including the ability to visualize all RNA isoforms of a gene and their expression levels; the annotation of known isoforms from external databases; mapping of protein domains and features to exons, allowing changes to protein sequence and function between isoforms to be established; and extensive species compatibility. Datasets visualised on IsoVis remain private to the user, allowing analysis of sensitive data. IsoVis visualisations can be downloaded to create publication-ready figures. The IsoVis webserver enables researchers to perform isoform analyses without requiring programming skills, is free to use, and available at https://isomix.org/isovis/.
Collapse
Affiliation(s)
- Ching Yin Wan
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Jack Davis
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Manveer Chauhan
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Josie Gleeson
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Yair D J Prawer
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Ricardo De Paoli-Iseppi
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Christine A Wells
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Jarny Choi
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| | - Michael B Clark
- Department of Anatomy and Physiology, The University of Melbourne, Parkville, Victoria, 3010, Australia
| |
Collapse
|
3
|
Liau WS, Zhao Q, Bademosi A, Gormal RS, Gong H, Marshall PR, Periyakaruppiah A, Madugalle SU, Zajaczkowski EL, Leighton LJ, Ren H, Musgrove M, Davies J, Rauch S, He C, Dickinson BC, Li X, Wei W, Meunier FA, Fernández-Moya SM, Kiebler MA, Srinivasan B, Banerjee S, Clark M, Spitale RC, Bredy TW. Fear extinction is regulated by the activity of long noncoding RNAs at the synapse. Nat Commun 2023; 14:7616. [PMID: 37993455 PMCID: PMC10665438 DOI: 10.1038/s41467-023-43535-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 11/12/2023] [Indexed: 11/24/2023] Open
Abstract
Long noncoding RNAs (lncRNAs) represent a multidimensional class of regulatory molecules that are involved in many aspects of brain function. Emerging evidence indicates that lncRNAs are localized to the synapse; however, a direct role for their activity in this subcellular compartment in memory formation has yet to be demonstrated. Using lncRNA capture-seq, we identified a specific set of lncRNAs that accumulate in the synaptic compartment within the infralimbic prefrontal cortex of adult male C57/Bl6 mice. Among these was a splice variant related to the stress-associated lncRNA, Gas5. RNA immunoprecipitation followed by mass spectrometry and single-molecule imaging revealed that this Gas5 isoform, in association with the RNA binding proteins G3BP2 and CAPRIN1, regulates the activity-dependent trafficking and clustering of RNA granules. In addition, we found that cell-type-specific, activity-dependent, and synapse-specific knockdown of the Gas5 variant led to impaired fear extinction memory. These findings identify a new mechanism of fear extinction that involves the dynamic interaction between local lncRNA activity and RNA condensates in the synaptic compartment.
Collapse
Affiliation(s)
- Wei-Siang Liau
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia.
| | - Qiongyi Zhao
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Adekunle Bademosi
- Single Molecule Neuroscience Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Rachel S Gormal
- Single Molecule Neuroscience Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Hao Gong
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Paul R Marshall
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Ambika Periyakaruppiah
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Sachithrani U Madugalle
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Esmi L Zajaczkowski
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Laura J Leighton
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Haobin Ren
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Mason Musgrove
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Joshua Davies
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Simone Rauch
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
| | - Chuan He
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
| | - Bryan C Dickinson
- Department of Chemistry, The University of Chicago, Chicago, IL, USA
| | - Xiang Li
- Department of Neurosurgery, Zhongnan Hospital of Wuhan University, Wuhan, China
- Medical Research Institute, Wuhan University, Wuhan, China
| | - Wei Wei
- Department of Neurosurgery, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Frédéric A Meunier
- Single Molecule Neuroscience Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia
- School of Biomedical Sciences, The University of Queensland, Brisbane, QLD, Australia
| | - Sandra M Fernández-Moya
- Biomedical Centre, Ludwig Maximilian University of Munich, Munich, Germany
- Gene Regulation of Cell Identity, Regenerative Medicine Program, Bellvitge Institute for Biomedical Research (IDIBELL) and Program for Advancing Clinical Translation of Regenerative Medicine of Catalonia, P-CMR[C], L'Hospitalet del Llobregat, 08908, Barcelona, Spain
| | - Michael A Kiebler
- Biomedical Centre, Ludwig Maximilian University of Munich, Munich, Germany
| | | | | | - Michael Clark
- Department of Anatomy and Physiology, University of Melbourne, Parkville, VIC, Australia
| | - Robert C Spitale
- Department of Pharmaceutical Sciences, The University of California, Irvine, CA, USA
| | - Timothy W Bredy
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
4
|
Wang Z, Zhou K, Yuan Q, Chen D, Hu X, Xie F, Liu Y, Xing J. A High-Efficiency Capture-Based NGS Approach for Comprehensive Analysis of Mitochondrial Transcriptome. Anal Chem 2023; 95:17046-17053. [PMID: 37937716 DOI: 10.1021/acs.analchem.3c03741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2023]
Abstract
The transcription of the mitochondrial genome is pivotal for maintenance of mitochondrial functions, and the deregulated mitochondrial transcriptome contributes to various pathological changes. Despite substantial progress having been achieved in uncovering the transcriptional complexity of the nuclear transcriptome, many unknowns and controversies remain for the mitochondrial transcriptome, partially owing to the lack of a highly efficient mitochondrial RNA (mtRNA) sequencing and analysis approach. Here, we first comprehensively evaluated the influence of essential experimental protocols, including strand-specific library construction, two RNA enrichment strategies, and optimal rRNA depletion, on accurately profiling mitochondrial transcriptome in whole-transcriptome sequencing (WTS) data. Based on these insights, we developed a highly efficient approach specifically suitable for targeted sequencing of whole mitochondrial transcriptome, termed capture-based mtRNA seq (CAP), in which strand-specific library construction and optimal rRNA depletion were applied. Compared with WTS, CAP has a great decrease of required data volume without affecting the sensitivity and accuracy of detection. In addition, CAP also characterized the unannotated mt-tRNA transcripts whose expression levels are below the detection limits of conventional WTS. As a proof-of-concept characterization of mtRNAs, the transcription initiation sites and mtRNA cleavage ratio were accurately identified in CAP data. Moreover, CAP had very reliable performance in plasma and single-cell samples, highlighting its wide application. Altogether, the present study has established a highly efficient pipeline for targeted sequencing of mtRNAs, which may pave the way toward functional annotation of mtRNAs and mtRNA-based diagnostic and therapeutic strategies in various diseases.
Collapse
Affiliation(s)
- Zhenni Wang
- State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and Department of Physiology and Pathophysiology, Fourth Military Medical University, Xi'an710032, China
| | - Kaixiang Zhou
- State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and Department of Physiology and Pathophysiology, Fourth Military Medical University, Xi'an710032, China
| | - Qing Yuan
- Institute of Medical Research, Northwestern Polytechnical University, Xi'an710072, China
| | - Dongbo Chen
- Institute of Medical Research, Northwestern Polytechnical University, Xi'an710072, China
| | - Xi'e Hu
- Department of General Surgery, Tangdu Hospital, Fourth Military Medical University, Xi'an710038, China
| | - Fanfan Xie
- State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and Department of Physiology and Pathophysiology, Fourth Military Medical University, Xi'an710032, China
| | - Yang Liu
- Department of Clinical Diagnosis, Tangdu Hospital, Fourth Military Medical University, Xi'an 710038, China
| | - Jinliang Xing
- State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and Department of Physiology and Pathophysiology, Fourth Military Medical University, Xi'an710032, China
| |
Collapse
|
5
|
Rietdijk S, Keszei M, Castro W, Terhorst C, Abadía-Molina AC. Characterization of Ly108-H1 Signaling Reveals Ly108-3 Expression and Additional Strain-Specific Differences in Lupus Prone Mice. Int J Mol Sci 2023; 24:5024. [PMID: 36902453 PMCID: PMC10003074 DOI: 10.3390/ijms24055024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 02/10/2023] [Accepted: 02/28/2023] [Indexed: 03/08/2023] Open
Abstract
Ly108 (SLAMF6) is a homophilic cell surface molecule that binds SLAM-associated protein (SAP), an intracellular adapter protein that modulates humoral immune responses. Furthermore, Ly108 is crucial for the development of natural killer T (NKT) cells and CTL cytotoxicity. Significant attention has been paid towards expression and function of Ly108 since multiple isoforms were identified, i.e., Ly108-1, Ly108-2, Ly108-3, and Ly108-H1, some of which are differentially expressed in several mouse strains. Surprisingly, Ly108-H1 appeared to protect against disease in a congenic mouse model of Lupus. Here, we use cell lines to further define Ly108-H1 function in comparison with other isoforms. We show that Ly108-H1 inhibits IL-2 production while having little effect upon cell death. With a refined method, we could detect phosphorylation of Ly108-H1 and show that SAP binding is retained. We propose that Ly108-H1 may regulate signaling at two levels by retaining the capability to bind its extracellular as well as intracellular ligands, possibly inhibiting downstream pathways. In addition, we detected Ly108-3 in primary cells and show that this isoform is also differentially expressed between mouse strains. The presence of additional binding motifs and a non-synonymous SNP in Ly108-3 further extends the diversity between murine strains. This work highlights the importance of isoform awareness, as inherent homology can present a challenge when interpreting mRNA and protein expression data, especially as alternatively splicing potentially affects function.
Collapse
Affiliation(s)
- Svend Rietdijk
- Unidad de Inmunología, IBIMER, CIBM, Universidad de Granada, 18016 Granada, Spain
- Division of Immunology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
- Department of Gastroenterology and Hepatology, OLVG Hospital, 1091 AC Amsterdam, The Netherlands
| | - Marton Keszei
- Division of Immunology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
| | - Wilson Castro
- Division of Immunology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
| | - Cox Terhorst
- Division of Immunology, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA 02215, USA
| | - Ana C. Abadía-Molina
- Unidad de Inmunología, IBIMER, CIBM, Universidad de Granada, 18016 Granada, Spain
- Departamento de Bioquímica y Biología Molecular III e Inmunología, Facultad de Medicina, Universidad de Granada, 18016 Granada, Spain
| |
Collapse
|
6
|
Wei W, Zhao Q, Wang Z, Liau WS, Basic D, Ren H, Marshall PR, Zajaczkowski EL, Leighton LJ, Madugalle SU, Musgrove M, Periyakaruppiah A, Shi J, Zhang J, Mattick JS, Mercer TR, Spitale RC, Li X, Bredy TW. ADRAM is an experience-dependent long noncoding RNA that drives fear extinction through a direct interaction with the chaperone protein 14-3-3. Cell Rep 2022; 38:110546. [PMID: 35320727 PMCID: PMC9015815 DOI: 10.1016/j.celrep.2022.110546] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 02/03/2022] [Accepted: 02/28/2022] [Indexed: 11/25/2022] Open
Abstract
Here, we used RNA capture-seq to identify a large population of lncRNAs that are expressed in the infralimbic prefrontal cortex of adult male mice in response to fear-related learning. Combining these data with cell-type-specific ATAC-seq on neurons that had been selectively activated by fear extinction learning, we find inducible 434 lncRNAs that are derived from enhancer regions in the vicinity of protein-coding genes. In particular, we discover an experience-induced lncRNA we call ADRAM (activity-dependent lncRNA associated with memory) that acts as both a scaffold and a combinatorial guide to recruit the brain-enriched chaperone protein 14-3-3 to the promoter of the memory-associated immediate-early gene Nr4a2 and is required fear extinction memory. This study expands the lexicon of experience-dependent lncRNA activity in the brain and highlights enhancer-derived RNAs (eRNAs) as key players in the epigenomic regulation of gene expression associated with the formation of fear extinction memory.
Collapse
Affiliation(s)
- Wei Wei
- Department of Neurosurgery, Zhongnan Hospital of Wuhan University, Wuhan, China; Brain Research Center, Zhongnan Hospital of Wuhan University, Wuhan, China; Medical Research Institute, Wuhan University, Wuhan, China.
| | - Qiongyi Zhao
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Ziqi Wang
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Wei-Siang Liau
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Dean Basic
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Haobin Ren
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Paul R Marshall
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Esmi L Zajaczkowski
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Laura J Leighton
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Sachithrani U Madugalle
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Mason Musgrove
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Ambika Periyakaruppiah
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia
| | - Jichun Shi
- Department of Neurosurgery, Zhongnan Hospital of Wuhan University, Wuhan, China; Brain Research Center, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - Jianjian Zhang
- Department of Neurosurgery, Zhongnan Hospital of Wuhan University, Wuhan, China
| | - John S Mattick
- School of Biotechnology and Biomolecular Sciences, The University of New South Wales, Sydney, Australia
| | - Timothy R Mercer
- Australian Institute of Bioengineering and Nanotechnology, The University of Queensland, Brisbane, Australia
| | - Robert C Spitale
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, USA
| | - Xiang Li
- Department of Neurosurgery, Zhongnan Hospital of Wuhan University, Wuhan, China; Brain Research Center, Zhongnan Hospital of Wuhan University, Wuhan, China; Medical Research Institute, Wuhan University, Wuhan, China
| | - Timothy W Bredy
- Cognitive Neuroepigenetics Laboratory, Queensland Brain Institute, The University of Queensland, Brisbane, Australia.
| |
Collapse
|
7
|
Chung M, Bruno VM, Rasko DA, Cuomo CA, Muñoz JF, Livny J, Shetty AC, Mahurkar A, Dunning Hotopp JC. Best practices on the differential expression analysis of multi-species RNA-seq. Genome Biol 2021; 22:121. [PMID: 33926528 PMCID: PMC8082843 DOI: 10.1186/s13059-021-02337-8] [Citation(s) in RCA: 50] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2020] [Accepted: 04/01/2021] [Indexed: 02/07/2023] Open
Abstract
Advances in transcriptome sequencing allow for simultaneous interrogation of differentially expressed genes from multiple species originating from a single RNA sample, termed dual or multi-species transcriptomics. Compared to single-species differential expression analysis, the design of multi-species differential expression experiments must account for the relative abundances of each organism of interest within the sample, often requiring enrichment methods and yielding differences in total read counts across samples. The analysis of multi-species transcriptomics datasets requires modifications to the alignment, quantification, and downstream analysis steps compared to the single-species analysis pipelines. We describe best practices for multi-species transcriptomics and differential gene expression.
Collapse
Affiliation(s)
- Matthew Chung
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Vincent M. Bruno
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - David A. Rasko
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Christina A. Cuomo
- Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA 02142 USA
| | - José F. Muñoz
- Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA 02142 USA
| | - Jonathan Livny
- Infectious Disease and Microbiome Program, Broad Institute, Cambridge, MA 02142 USA
| | - Amol C. Shetty
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Anup Mahurkar
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
| | - Julie C. Dunning Hotopp
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201 USA
- Greenebaum Cancer Center, University of Maryland, Baltimore, MD 21201 USA
| |
Collapse
|
8
|
Abstract
Metazoan genomes produce thousands of long-noncoding RNAs (lncRNAs), of which just a small fraction have been well characterized. Understanding their biological functions requires accurate annotations, or maps of the precise location and structure of genes and transcripts in the genome. Current lncRNA annotations are limited by compromises between quality and size, with many gene models being fragmentary or uncatalogued. To overcome this, the GENCODE consortium has developed RNA capture long-read sequencing (CLS), an approach combining targeted RNA capture with third-generation long-read sequencing. CLS provides accurate annotations at high-throughput rates. It eliminates the need for noisy transcriptome assembly from short reads, and requires minimal manual curation. The full-length transcript models produced are of quality comparable to present-day manually curated annotations. Here we describe a detailed CLS protocol, from probe design through long-read sequencing to creation of final annotations.
Collapse
|
9
|
Curion F, Handel AE, Attar M, Gallone G, Bowden R, Cader MZ, Clark MB. Targeted RNA sequencing enhances gene expression profiling of ultra-low input samples. RNA Biol 2020; 17:1741-1753. [PMID: 32597303 PMCID: PMC7746246 DOI: 10.1080/15476286.2020.1777768] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 03/16/2020] [Accepted: 04/20/2020] [Indexed: 12/22/2022] Open
Abstract
RNA-seq is the standard method for profiling gene expression in many biological systems. Due to the wide dynamic range and complex nature of the transcriptome, RNA-seq provides an incomplete characterization, especially of lowly expressed genes and transcripts. Targeted RNA sequencing (RNA CaptureSeq) focuses sequencing on genes of interest, providing exquisite sensitivity for transcript detection and quantification. However, uses of CaptureSeq have focused on bulk samples and its performance on very small populations of cells is unknown. Here we show CaptureSeq greatly enhances transcriptomic profiling of target genes in ultra-low-input samples and provides equivalent performance to that on bulk samples. We validate the performance of CaptureSeq using multiple probe sets on samples of iPSC-derived cortical neurons. We demonstrate up to 275-fold enrichment for target genes, the detection of 10% additional genes and a greater than 5-fold increase in identified gene isoforms. Analysis of spike-in controls demonstrated CaptureSeq improved both detection sensitivity and expression quantification. Comparison to the CORTECON database of cerebral cortex development revealed CaptureSeq enhanced the identification of sample differentiation stage. CaptureSeq provides sensitive, reliable and quantitative expression measurements on hundreds-to-thousands of target genes from ultra-low-input samples and has the potential to greatly enhance transcriptomic profiling when samples are limiting.
Collapse
Affiliation(s)
- Fabiola Curion
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Adam E Handel
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
- Translational Molecular Neuroscience Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Moustafa Attar
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Kennedy Institute of Rheumatology, University of Oxford, Oxford, UK
| | - Giuseppe Gallone
- Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford, UK
| | - Rory Bowden
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - M. Zameel Cader
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
- Translational Molecular Neuroscience Group, Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
| | - Michael B Clark
- Department of Psychiatry, University of Oxford, Oxford, UK
- Centre for Stem Cell Systems, Department of Anatomy and Neuroscience, The University of Melbourne, Parkville, Australia
| |
Collapse
|
10
|
Ray TA, Cochran K, Kozlowski C, Wang J, Alexander G, Cady MA, Spencer WJ, Ruzycki PA, Clark BS, Laeremans A, He MX, Wang X, Park E, Hao Y, Iannaccone A, Hu G, Fedrigo O, Skiba NP, Arshavsky VY, Kay JN. Comprehensive identification of mRNA isoforms reveals the diversity of neural cell-surface molecules with roles in retinal development and disease. Nat Commun 2020; 11:3328. [PMID: 32620864 PMCID: PMC7335077 DOI: 10.1038/s41467-020-17009-7] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 05/30/2020] [Indexed: 02/08/2023] Open
Abstract
Genes encoding cell-surface proteins control nervous system development and are implicated in neurological disorders. These genes produce alternative mRNA isoforms which remain poorly characterized, impeding understanding of how disease-associated mutations cause pathology. Here we introduce a strategy to define complete portfolios of full-length isoforms encoded by individual genes. Applying this approach to neural cell-surface molecules, we identify thousands of unannotated isoforms expressed in retina and brain. By mass spectrometry we confirm expression of newly-discovered proteins on the cell surface in vivo. Remarkably, we discover that the major isoform of a retinal degeneration gene, CRB1, was previously overlooked. This CRB1 isoform is the only one expressed by photoreceptors, the affected cells in CRB1 disease. Using mouse mutants, we identify a function for this isoform at photoreceptor-glial junctions and demonstrate that loss of this isoform accelerates photoreceptor death. Therefore, our isoform identification strategy enables discovery of new gene functions relevant to disease.
Collapse
Affiliation(s)
- Thomas A Ray
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Kelly Cochran
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Chris Kozlowski
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Jingjing Wang
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Graham Alexander
- Center for Genomic and Computational Biology, Duke University, Durham, NC, 27710, USA
| | | | | | - Philip A Ruzycki
- John F. Hardesty, M.D. Department of Ophthalmology and Visual Sciences, Washington University, St. Louis, MO, 63110, USA
| | - Brian S Clark
- John F. Hardesty, M.D. Department of Ophthalmology and Visual Sciences, Washington University, St. Louis, MO, 63110, USA
- Department of Developmental Biology, Washington University, St. Louis, MO, 63110, USA
| | | | - Ming-Xiao He
- Advanced Cell Diagnostics, Newark, CA, 94560, USA
| | | | - Emily Park
- Advanced Cell Diagnostics, Newark, CA, 94560, USA
| | - Ying Hao
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Alessandro Iannaccone
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Gary Hu
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, 27710, USA
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Olivier Fedrigo
- Center for Genomic and Computational Biology, Duke University, Durham, NC, 27710, USA
- The Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
| | - Nikolai P Skiba
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Vadim Y Arshavsky
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA
| | - Jeremy N Kay
- Department of Neurobiology, Duke University School of Medicine, Durham, NC, 27710, USA.
- Department of Ophthalmology, Duke University School of Medicine, Durham, NC, 27710, USA.
| |
Collapse
|
11
|
Wu W, Ji X, Zhao Y. Emerging Roles of Long Non-coding RNAs in Chronic Neuropathic Pain. Front Neurosci 2019; 13:1097. [PMID: 31680832 PMCID: PMC6813851 DOI: 10.3389/fnins.2019.01097] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 09/30/2019] [Indexed: 02/06/2023] Open
Abstract
Chronic neuropathic pain, a type of chronic and potentially disabling pain caused by a disease or injury of the somatosensory nervous system, spinal cord injury, or various chronic conditions, such as viral infections (e.g., post-herpetic neuralgia), autoimmune diseases, cancers, and metabolic disorders (e.g., diabetes mellitus), is one of the most intense types of chronic pain, which incurs a major socio-economic burden and is a serious public health issue, with an estimated prevalence of 7–10% in adults throughout the world. Presently, the available drug treatments (e.g., anticonvulsants acting at calcium channels, serotonin-noradrenaline reuptake inhibitors, tricyclic antidepressants, opioids, topical lidocaine, etc.) for chronic neuropathic pain patients are still rare and have disappointing efficacy, which makes it difficult to relieve the patients’ painful symptoms, and, at best, they only try to reduce the patients’ ability to tolerate pain. Long non-coding RNAs (lncRNAs), a type of transcript of more than 200 nucleotides with no protein-coding or limited capacity, were identified to be abnormally expressed in the spinal cord, dorsal root ganglion, hippocampus, and prefrontal cortex under chronic neuropathic pain conditions. Moreover, a rapidly growing body of data has clearly pointed out that nearly 40% of lncRNAs exist specifically in the nervous system. Hence, it was speculated that these dysregulated lncRNAs might participate in the occurrence, development, and progression of chronic neuropathic pain. In other words, if we deeply delve into the potential roles of lncRNAs in the pathogenesis of chronic neuropathic pain, this may open up new strategies and directions for the development of novel targeted drugs to cure this refractory disorder. In this article, we primarily review the status of chronic neuropathic pain and provide a general overview of lncRNAs, the detailed roles of lncRNAs in the nervous system and its related diseases, and the abnormal expression of lncRNAs and their potential clinical applications in chronic neuropathic pain. We hope that through the above description, readers can gain a better understanding of the emerging roles of lncRNAs in chronic neuropathic pain.
Collapse
Affiliation(s)
- Wei Wu
- College of Food Science and Engineering, Qingdao Agricultural University, Qingdao, China
| | - Xiaojun Ji
- Department of Neurology, Affiliated Hospital of Qingdao University, Qingdao, China
| | - Yang Zhao
- Department of Anesthesiology, Affiliated Hospital to Qingdao University, Qingdao, China
| |
Collapse
|
12
|
Hardwick SA, Bassett SD, Kaczorowski D, Blackburn J, Barton K, Bartonicek N, Carswell SL, Tilgner HU, Loy C, Halliday G, Mercer TR, Smith MA, Mattick JS. Targeted, High-Resolution RNA Sequencing of Non-coding Genomic Regions Associated With Neuropsychiatric Functions. Front Genet 2019; 10:309. [PMID: 31031799 PMCID: PMC6473190 DOI: 10.3389/fgene.2019.00309] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2019] [Accepted: 03/21/2019] [Indexed: 12/18/2022] Open
Abstract
The human brain is one of the last frontiers of biomedical research. Genome-wide association studies (GWAS) have succeeded in identifying thousands of haplotype blocks associated with a range of neuropsychiatric traits, including disorders such as schizophrenia, Alzheimer's and Parkinson's disease. However, the majority of single nucleotide polymorphisms (SNPs) that mark these haplotype blocks fall within non-coding regions of the genome, hindering their functional validation. While some of these GWAS loci may contain cis-acting regulatory DNA elements such as enhancers, we hypothesized that many are also transcribed into non-coding RNAs that are missing from publicly available transcriptome annotations. Here, we use targeted RNA capture ('RNA CaptureSeq') in combination with nanopore long-read cDNA sequencing to transcriptionally profile 1,023 haplotype blocks across the genome containing non-coding GWAS SNPs associated with neuropsychiatric traits, using post-mortem human brain tissue from three neurologically healthy donors. We find that the majority (62%) of targeted haplotype blocks, including 13% of intergenic blocks, are transcribed into novel, multi-exonic RNAs, most of which are not yet recorded in GENCODE annotations. We validated our findings with short-read RNA-seq, providing orthogonal confirmation of novel splice junctions and enabling a quantitative assessment of the long-read assemblies. Many novel transcripts are supported by independent evidence of transcription including cap analysis of gene expression (CAGE) data and epigenetic marks, and some show signs of potential functional roles. We present these transcriptomes as a preliminary atlas of non-coding transcription in human brain that can be used to connect neurological phenotypes with gene expression.
Collapse
Affiliation(s)
- Simon A. Hardwick
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
- Brain and Mind Research Institute and Center for Neurogenetics, Weill Cornell Medicine, New York, NY, United States
| | - Samuel D. Bassett
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
| | - Dominik Kaczorowski
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - James Blackburn
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
| | - Kirston Barton
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Nenad Bartonicek
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
| | - Shaun L. Carswell
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Hagen U. Tilgner
- Brain and Mind Research Institute and Center for Neurogenetics, Weill Cornell Medicine, New York, NY, United States
| | - Clement Loy
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Brain and Mind Centre, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW, Australia
| | - Glenda Halliday
- Brain and Mind Centre, Faculty of Medicine and Health, The University of Sydney, Camperdown, NSW, Australia
| | - Tim R. Mercer
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
- Altius Institute for Biomedical Sciences, Seattle, WA, United States
| | - Martin A. Smith
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
| | - John S. Mattick
- Genomics and Epigenetics Division, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
- Faculty of Medicine, University of New South Wales Sydney, Kensington, NSW, Australia
- Green Templeton College, Oxford, United Kingdom
| |
Collapse
|
13
|
Turner AW, Wong D, Khan MD, Dreisbach CN, Palmore M, Miller CL. Multi-Omics Approaches to Study Long Non-coding RNA Function in Atherosclerosis. Front Cardiovasc Med 2019; 6:9. [PMID: 30838214 PMCID: PMC6389617 DOI: 10.3389/fcvm.2019.00009] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2018] [Accepted: 01/30/2019] [Indexed: 12/15/2022] Open
Abstract
Atherosclerosis is a complex inflammatory disease of the vessel wall involving the interplay of multiple cell types including vascular smooth muscle cells, endothelial cells, and macrophages. Large-scale genome-wide association studies (GWAS) and the advancement of next generation sequencing technologies have rapidly expanded the number of long non-coding RNA (lncRNA) transcripts predicted to play critical roles in the pathogenesis of the disease. In this review, we highlight several lncRNAs whose functional role in atherosclerosis is well-documented through traditional biochemical approaches as well as those identified through RNA-sequencing and other high-throughput assays. We describe novel genomics approaches to study both evolutionarily conserved and divergent lncRNA functions and interactions with DNA, RNA, and proteins. We also highlight assays to resolve the complex spatial and temporal regulation of lncRNAs. Finally, we summarize the latest suite of computational tools designed to improve genomic and functional annotation of these transcripts in the human genome. Deep characterization of lncRNAs is fundamental to unravel coronary atherosclerosis and other cardiovascular diseases, as these regulatory molecules represent a new class of potential therapeutic targets and/or diagnostic markers to mitigate both genetic and environmental risk factors.
Collapse
Affiliation(s)
- Adam W. Turner
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States
| | - Doris Wong
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, United States
| | - Mohammad Daud Khan
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States
| | - Caitlin N. Dreisbach
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States
- School of Nursing, University of Virginia, Charlottesville, VA, United States
- Data Science Institute, University of Virginia, Charlottesville, VA, United States
| | - Meredith Palmore
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States
| | - Clint L. Miller
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, United States
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, United States
- Data Science Institute, University of Virginia, Charlottesville, VA, United States
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, United States
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, United States
| |
Collapse
|
14
|
Uszczynska-Ratajczak B, Lagarde J, Frankish A, Guigó R, Johnson R. Towards a complete map of the human long non-coding RNA transcriptome. Nat Rev Genet 2018; 19:535-548. [PMID: 29795125 PMCID: PMC6451964 DOI: 10.1038/s41576-018-0017-y] [Citation(s) in RCA: 408] [Impact Index Per Article: 58.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Gene maps, or annotations, enable us to navigate the functional landscape of our genome. They are a resource upon which virtually all studies depend, from single-gene to genome-wide scales and from basic molecular biology to medical genetics. Yet present-day annotations suffer from trade-offs between quality and size, with serious but often unappreciated consequences for downstream studies. This is particularly true for long non-coding RNAs (lncRNAs), which are poorly characterized compared to protein-coding genes. Long-read sequencing technologies promise to improve current annotations, paving the way towards a complete annotation of lncRNAs expressed throughout a human lifetime.
Collapse
Affiliation(s)
| | - Julien Lagarde
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Catalonia, Spain
| | - Adam Frankish
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Roderic Guigó
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Catalonia, Spain
| | - Rory Johnson
- Department of Medical Oncology, Inselspital, University Hospital and University of Bern, Bern, Switzerland.
- Department of Biomedical Research (DBMR), University of Bern, Bern, Switzerland.
| |
Collapse
|
15
|
Abstract
Identification of important, functional small RNA (sRNA) species is currently hampered by the lack of reliable and sensitive methods to isolate and characterize them. We have developed a method, termed target-enrichment of sRNAs (TEsR), that enables targeted sequencing of rare sRNAs and diverse precursor and mature forms of sRNAs not detectable by current standard sRNA sequencing methods. It is based on the amplification of full-length sRNA molecules, production of biotinylated RNA probes, hybridization to one or multiple targeted RNAs, removal of nontargeted sRNAs and sequencing. By this approach, target sRNAs can be enriched by a factor of 500-30,000 while maintaining strand specificity. TEsR enriches for sRNAs irrespective of length or different molecular features, such as the presence or absence of a 5' cap or of secondary structures or abundance levels. Moreover, TEsR allows the detection of the complete sequence (including sequence variants, and 5' and 3' ends) of precursors, as well as intermediate and mature forms, in a quantitative manner. A well-trained molecular biologist can complete the TEsR procedure, from RNA extraction to sequencing library preparation, within 4-6 d.
Collapse
|
16
|
Abstract
It is estimated that more than 90% of the mammalian genome is transcribed as non-coding RNAs. Recent evidences have established that these non-coding transcripts are not junk or just transcriptional noise, but they do serve important biological purpose. One of the rapidly expanding fields of this class of transcripts is the regulatory lncRNAs, which had been a major challenge in terms of their molecular functions and mechanisms of action. The emergence of high-throughput technologies and the development in various conventional approaches have led to the expansion of the lncRNA world. The combination of multidisciplinary approaches has proven to be essential to unravel the complexity of their regulatory networks and helped establish the importance of their existence. Here, we review the current methodologies available for discovering and investigating functions of long non-coding RNAs (lncRNAs) and focus on the powerful technological advancement available to specifically address their functional importance.
Collapse
|
17
|
Clark BS, Blackshaw S. Understanding the Role of lncRNAs in Nervous System Development. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1008:253-282. [PMID: 28815543 DOI: 10.1007/978-981-10-5203-3_9] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The diversity of lncRNAs has expanded within mammals in tandem with the evolution of increased brain complexity, suggesting that lncRNAs play an integral role in this process. In this chapter, we will highlight the identification and characterization of lncRNAs in nervous system development. We discuss the potential role of lncRNAs in nervous system and brain evolution, along with efforts to create comprehensive catalogues that analyze spatial and temporal changes in lncRNA expression during nervous system development. Additionally, we focus on recent endeavors that attempt to assign function to lncRNAs during nervous system development. We highlight discrepancies that have been observed between in vitro and in vivo studies of lncRNA function and the challenges facing researchers in conducting mechanistic analyses of lncRNAs in the developing nervous system. Altogether, this chapter highlights the emerging role of lncRNAs in the developing brain and sheds light on novel, RNA-mediated mechanisms by which nervous system development is controlled.
Collapse
Affiliation(s)
- Brian S Clark
- Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Seth Blackshaw
- Johns Hopkins University School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
18
|
Lagarde J, Uszczynska-Ratajczak B, Carbonell S, Pérez-Lluch S, Abad A, Davis C, Gingeras TR, Frankish A, Harrow J, Guigo R, Johnson R. High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing. Nat Genet 2017; 49:1731-1740. [PMID: 29106417 PMCID: PMC5709232 DOI: 10.1038/ng.3988] [Citation(s) in RCA: 182] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Accepted: 10/11/2017] [Indexed: 12/20/2022]
Abstract
Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.
Collapse
Affiliation(s)
- Julien Lagarde
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Barbara Uszczynska-Ratajczak
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Silvia Carbonell
- R&D Department, Quantitative Genomic Medicine Laboratories (qGenomics), Barcelona, Spain
| | - Sílvia Pérez-Lluch
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Amaya Abad
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Carrie Davis
- Functional Genomics Group, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Thomas R. Gingeras
- Functional Genomics Group, Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, New York 11724, USA
| | - Adam Frankish
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK CB10 1HH
| | - Jennifer Harrow
- Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, UK CB10 1HH
| | - Roderic Guigo
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Rory Johnson
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| |
Collapse
|
19
|
Kietrys AM, Velema WA, Kool ET. Fingerprints of Modified RNA Bases from Deep Sequencing Profiles. J Am Chem Soc 2017; 139:17074-17081. [PMID: 29111692 PMCID: PMC5819333 DOI: 10.1021/jacs.7b07914] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Posttranscriptional modifications of RNA bases are not only found in many noncoding RNAs but have also recently been identified in coding (messenger) RNAs as well. They require complex and laborious methods to locate, and many still lack methods for localized detection. Here we test the ability of next-generation sequencing (NGS) to detect and distinguish between ten modified bases in synthetic RNAs. We compare ultradeep sequencing patterns of modified bases, including miscoding, insertions and deletions (indels), and truncations, to unmodified bases in the same contexts. The data show widely varied responses to modification, ranging from no response, to high levels of mutations, insertions, deletions, and truncations. The patterns are distinct for several of the modifications, and suggest the future use of ultradeep sequencing as a fingerprinting strategy for locating and identifying modifications in cellular RNAs.
Collapse
Affiliation(s)
- Anna M. Kietrys
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Willem A. Velema
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Eric T. Kool
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
20
|
When Long Noncoding RNAs Meet Genome Editing in Pluripotent Stem Cells. Stem Cells Int 2017; 2017:3250624. [PMID: 29333164 PMCID: PMC5733163 DOI: 10.1155/2017/3250624] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Accepted: 10/25/2017] [Indexed: 11/18/2022] Open
Abstract
Most of the human genome can be transcribed into RNAs, but only a minority of these regions produce protein-coding mRNAs whereas the remaining regions are transcribed into noncoding RNAs. Long noncoding RNAs (lncRNAs) were known for their influential regulatory roles in multiple biological processes such as imprinting, dosage compensation, transcriptional regulation, and splicing. The physiological functions of protein-coding genes have been extensively characterized through genome editing in pluripotent stem cells (PSCs) in the past 30 years; however, the study of lncRNAs with genome editing technologies only came into attentions in recent years. Here, we summarize recent advancements in dissecting the roles of lncRNAs with genome editing technologies in PSCs and highlight potential genome editing tools useful for examining the functions of lncRNAs in PSCs.
Collapse
|
21
|
Steward CA, Parker APJ, Minassian BA, Sisodiya SM, Frankish A, Harrow J. Genome annotation for clinical genomic diagnostics: strengths and weaknesses. Genome Med 2017; 9:49. [PMID: 28558813 PMCID: PMC5448149 DOI: 10.1186/s13073-017-0441-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
The Human Genome Project and advances in DNA sequencing technologies have revolutionized the identification of genetic disorders through the use of clinical exome sequencing. However, in a considerable number of patients, the genetic basis remains unclear. As clinicians begin to consider whole-genome sequencing, an understanding of the processes and tools involved and the factors to consider in the annotation of the structure and function of genomic elements that might influence variant identification is crucial. Here, we discuss and illustrate the strengths and weaknesses of approaches for the annotation and classification of important elements of protein-coding genes, other genomic elements such as pseudogenes and the non-coding genome, comparative-genomic approaches for inferring gene function, and new technologies for aiding genome annotation, as a practical guide for clinicians when considering pathogenic sequence variation. Complete and accurate annotation of structure and function of genome features has the potential to reduce both false-negative (from missing annotation) and false-positive (from incorrect annotation) errors in causal variant identification in exome and genome sequences. Re-analysis of unsolved cases will be necessary as newer technology improves genome annotation, potentially improving the rate of diagnosis.
Collapse
Affiliation(s)
- Charles A Steward
- Congenica Ltd, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1DR, UK. .,The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | | | - Berge A Minassian
- Department of Pediatrics (Neurology), University of Texas Southwestern, Dallas, TX, USA.,Program in Genetics and Genome Biology and Department of Paediatrics (Neurology), The Hospital for Sick Children and University of Toronto, Toronto, Canada
| | - Sanjay M Sisodiya
- Department of Clinical and Experimental Epilepsy, UCL Institute of Neurology, London, WC1N 3BG, UK.,Chalfont Centre for Epilepsy, Chesham Lane, Chalfont St Peter, Buckinghamshire, SL9 0RJ, UK
| | - Adam Frankish
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Jennifer Harrow
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.,Illumina Inc, Great Chesterford, Essex, CB10 1XL, UK
| |
Collapse
|
22
|
Cheng CY, Krishnakumar V, Chan AP, Thibaud-Nissen F, Schobel S, Town CD. Araport11: a complete reannotation of the Arabidopsis thaliana reference genome. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 89:789-804. [PMID: 27862469 DOI: 10.1111/tpj.13415] [Citation(s) in RCA: 664] [Impact Index Per Article: 83.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Revised: 10/27/2016] [Accepted: 11/03/2016] [Indexed: 05/20/2023]
Abstract
The flowering plant Arabidopsis thaliana is a dicot model organism for research in many aspects of plant biology. A comprehensive annotation of its genome paves the way for understanding the functions and activities of all types of transcripts, including mRNA, the various classes of non-coding RNA, and small RNA. The TAIR10 annotation update had a profound impact on Arabidopsis research but was released more than 5 years ago. Maintaining the accuracy of the annotation continues to be a prerequisite for future progress. Using an integrative annotation pipeline, we assembled tissue-specific RNA-Seq libraries from 113 datasets and constructed 48 359 transcript models of protein-coding genes in eleven tissues. In addition, we annotated various classes of non-coding RNA including microRNA, long intergenic RNA, small nucleolar RNA, natural antisense transcript, small nuclear RNA, and small RNA using published datasets and in-house analytic results. Altogether, we identified 635 novel protein-coding genes, 508 novel transcribed regions, 5178 non-coding RNAs, and 35 846 small RNA loci that were formerly unannotated. Analysis of the splicing events and RNA-Seq based expression profiles revealed the landscapes of gene structures, untranslated regions, and splicing activities to be more intricate than previously appreciated. Furthermore, we present 692 uniformly expressed housekeeping genes, 43% of whose human orthologs are also housekeeping genes. This updated Arabidopsis genome annotation with a substantially increased resolution of gene models will not only further our understanding of the biological processes of this plant model but also of other species.
Collapse
Affiliation(s)
- Chia-Yi Cheng
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Vivek Krishnakumar
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Agnes P Chan
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Françoise Thibaud-Nissen
- National Center for Biotechnology Information, US National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Seth Schobel
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| | - Christopher D Town
- J. Craig Venter Institute, 9714 Medical Center Drive, Rockville, MD, 20850, USA
| |
Collapse
|
23
|
Signal B, Gloss BS, Dinger ME. Computational Approaches for Functional Prediction and Characterisation of Long Noncoding RNAs. Trends Genet 2016; 32:620-637. [PMID: 27592414 DOI: 10.1016/j.tig.2016.08.004] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 08/03/2016] [Accepted: 08/04/2016] [Indexed: 02/09/2023]
Abstract
Although a considerable portion of eukaryotic genomes is transcribed as long noncoding RNAs (lncRNAs), the vast majority are functionally uncharacterised. The rapidly expanding catalogue of mechanistically investigated lncRNAs has provided evidence for distinct functional subclasses, which are now ripe for exploitation as a general model to predict functions for uncharacterised lncRNAs. By utilising publicly-available genome-wide datasets and computational methods, we present several developed and emerging in silico approaches to characterise and predict the functions of lncRNAs. We propose that the application of these techniques provides valuable functional and mechanistic insight into lncRNAs, and is a crucial step for informing subsequent functional studies.
Collapse
Affiliation(s)
- Bethany Signal
- Garvan Institute of Medical Research, Sydney, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, Australia
| | - Brian S Gloss
- Garvan Institute of Medical Research, Sydney, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, Australia
| | - Marcel E Dinger
- Garvan Institute of Medical Research, Sydney, Australia; St Vincent's Clinical School, University of New South Wales, Sydney, Australia.
| |
Collapse
|