1
|
HOPX-associated molecular programs control cardiomyocyte cell states underpinning cardiac structure and function. Dev Cell 2024; 59:91-107.e6. [PMID: 38091997 DOI: 10.1016/j.devcel.2023.11.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 05/09/2023] [Accepted: 11/13/2023] [Indexed: 01/11/2024]
Abstract
Genomic regulation of cardiomyocyte differentiation is central to heart development and function. This study uses genetic loss-of-function human-induced pluripotent stem cell-derived cardiomyocytes to evaluate the genomic regulatory basis of the non-DNA-binding homeodomain protein HOPX. We show that HOPX interacts with and controls cardiac genes and enhancer networks associated with diverse aspects of heart development. Using perturbation studies in vitro, we define how upstream cell growth and proliferation control HOPX transcription to regulate cardiac gene programs. We then use cell, organoid, and zebrafish regeneration models to demonstrate that HOPX-regulated gene programs control cardiomyocyte function in development and disease. Collectively, this study mechanistically links cell signaling pathways as upstream regulators of HOPX transcription to control gene programs underpinning cardiomyocyte identity and function.
Collapse
|
2
|
Vasculature organotropism in drug delivery. Adv Drug Deliv Rev 2023; 201:115054. [PMID: 37591370 PMCID: PMC10693934 DOI: 10.1016/j.addr.2023.115054] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 07/22/2023] [Accepted: 08/13/2023] [Indexed: 08/19/2023]
Abstract
Over the past decades, there has been an exponential increase in the development of preclinical and clinical nanodelivery systems, and recently, an accelerating demand to deliver RNA and protein-based therapeutics. Organ-specific vasculature provides a promising intermediary for site-specific delivery of nanoparticles and extracellular vesicles to interstitial cells. Endothelial cells express organ-specific surface marker repertoires that can be used for targeted delivery. This article highlights organ-specific vasculature properties, nanodelivery strategies that exploit vasculature organotropism, and overlooked challenges and opportunities in targeting and simultaneously overcoming the endothelial barrier. Impediments in the clinical translation of vasculature organotropism in drug delivery are also discussed.
Collapse
|
3
|
mRNA vaccine quality analysis using RNA sequencing. Nat Commun 2023; 14:5663. [PMID: 37735471 PMCID: PMC10514319 DOI: 10.1038/s41467-023-41354-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 08/24/2023] [Indexed: 09/23/2023] Open
Abstract
The success of mRNA vaccines has been realised, in part, by advances in manufacturing that enabled billions of doses to be produced at sufficient quality and safety. However, mRNA vaccines must be rigorously analysed to measure their integrity and detect contaminants that reduce their effectiveness and induce side-effects. Currently, mRNA vaccines and therapies are analysed using a range of time-consuming and costly methods. Here we describe a streamlined method to analyse mRNA vaccines and therapies using long-read nanopore sequencing. Compared to other industry-standard techniques, VAX-seq can comprehensively measure key mRNA vaccine quality attributes, including sequence, length, integrity, and purity. We also show how direct RNA sequencing can analyse mRNA chemistry, including the detection of nucleoside modifications. To support this approach, we provide supporting software to automatically report on mRNA and plasmid template quality and integrity. Given these advantages, we anticipate that RNA sequencing methods, such as VAX-seq, will become central to the development and manufacture of mRNA drugs.
Collapse
|
4
|
Somatic retrotransposition in the developing rhesus macaque brain. Genome Res 2022; 32:1298-1314. [PMID: 35728967 PMCID: PMC9341517 DOI: 10.1101/gr.276451.121] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 06/14/2022] [Indexed: 12/03/2022]
Abstract
The retrotransposon LINE-1 (L1) is central to the recent evolutionary history of the human genome and continues to drive genetic diversity and germline pathogenesis. However, the spatiotemporal extent and biological significance of somatic L1 activity are poorly defined and are virtually unexplored in other primates. From a single L1 lineage active at the divergence of apes and Old World monkeys, successive L1 subfamilies have emerged in each descendant primate germline. As revealed by case studies, the presently active human L1 subfamily can also mobilize during embryonic and brain development in vivo. It is unknown whether nonhuman primate L1s can similarly generate somatic insertions in the brain. Here we applied approximately 40× single-cell whole-genome sequencing (scWGS), as well as retrotransposon capture sequencing (RC-seq), to 20 hippocampal neurons from two rhesus macaques (Macaca mulatta). In one animal, we detected and PCR-validated a somatic L1 insertion that generated target site duplications, carried a short 5′ transduction, and was present in ∼7% of hippocampal neurons but absent from cerebellum and nonbrain tissues. The corresponding donor L1 allele was exceptionally mobile in vitro and was embedded in PRDM4, a gene expressed throughout development and in neural stem cells. Nanopore long-read methylome and RNA-seq transcriptome analyses indicated young retrotransposon subfamily activation in the early embryo, followed by repression in adult tissues. These data highlight endogenous macaque L1 retrotransposition potential, provide prototypical evidence of L1-mediated somatic mosaicism in a nonhuman primate, and allude to L1 mobility in the brain over the past 30 million years of human evolution.
Collapse
|
5
|
Methylartist: Tools for Visualising Modified Bases from Nanopore Sequence Data. Bioinformatics 2022; 38:3109-3112. [PMID: 35482479 PMCID: PMC9154218 DOI: 10.1093/bioinformatics/btac292] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 03/11/2022] [Accepted: 04/21/2022] [Indexed: 11/12/2022] Open
Abstract
Summary Methylartist is a consolidated suite of tools for processing, visualizing and analysing nanopore-derived modified base calls. All detectable methylation types (e.g. 5mCpG, 5hmC, 6mA) are supported, enabling integrated study of base pairs when modified naturally or as part of an experimental protocol. Availability and implementation Methylartist is implemented in Python and is installable via PyPI and bioconda. Source code and test data are available at https://github.com/adamewing/methylartist. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
|
6
|
In vivo targeted DamID identifies CHD8 genomic targets in fetal mouse brain. iScience 2021; 24:103234. [PMID: 34746699 PMCID: PMC8551073 DOI: 10.1016/j.isci.2021.103234] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 08/11/2021] [Accepted: 10/04/2021] [Indexed: 01/15/2023] Open
Abstract
Genetic studies of autism have revealed causal roles for chromatin remodeling gene mutations. Chromodomain helicase DNA binding protein 8 (CHD8) encodes a chromatin remodeler with significant de novo mutation rates in sporadic autism. However, relationships between CHD8 genomic function and autism-relevant biology remain poorly elucidated. Published studies utilizing ChIP-seq to map CHD8 protein-DNA interactions have high variability, consistent with technical challenges and limitations associated with this method. Thus, complementary approaches are needed to establish CHD8 genomic targets and regulatory functions in developing brain. We used in utero CHD8 Targeted DamID followed by sequencing (TaDa-seq) to characterize CHD8 binding in embryonic mouse cortex. CHD8 TaDa-seq reproduced interaction patterns observed from ChIP-seq and further highlighted CHD8 distal interactions associated with neuronal loci. This study establishes TaDa-seq as a useful alternative for mapping protein-DNA interactions in vivo and provides insights into the regulatory targets of CHD8 and autism-relevant pathophysiology associated with CHD8 mutations.
Collapse
|
7
|
Processed pseudogenes: A substrate for evolutionary innovation: Retrotransposition contributes to genome evolution by propagating pseudogene sequences with rich regulatory potential throughout the genome. Bioessays 2021; 43:e2100186. [PMID: 34569081 DOI: 10.1002/bies.202100186] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 09/09/2021] [Accepted: 09/13/2021] [Indexed: 11/08/2022]
Abstract
Processed pseudogenes may serve as a genetic reservoir for evolutionary innovation. Here, we argue that through the activity of long interspersed element-1 retrotransposons, processed pseudogenes disperse coding and noncoding sequences rich with regulatory potential throughout the human genome. While these sequences may appear to be non-functional, a lack of contemporary function does not prohibit future development of biological activity. Here, we discuss the dynamic evolution of certain processed pseudogenes into coding and noncoding genes and regulatory elements, and their implication in wide-ranging biological and pathological processes. Also see the video abstract here: https://youtu.be/iUY_mteVoPI.
Collapse
|
8
|
Long-read cDNA sequencing identifies functional pseudogenes in the human transcriptome. Genome Biol 2021; 22:146. [PMID: 33971925 PMCID: PMC8108447 DOI: 10.1186/s13059-021-02369-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 04/28/2021] [Indexed: 01/05/2023] Open
Abstract
Pseudogenes are gene copies presumed to mainly be functionless relics of evolution due to acquired deleterious mutations or transcriptional silencing. Using deep full-length PacBio cDNA sequencing of normal human tissues and cancer cell lines, we identify here hundreds of novel transcribed pseudogenes expressed in tissue-specific patterns. Some pseudogene transcripts have intact open reading frames and are translated in cultured cells, representing unannotated protein-coding genes. To assess the biological impact of noncoding pseudogenes, we CRISPR-Cas9 delete the nucleus-enriched pseudogene PDCL3P4 and observe hundreds of perturbed genes. This study highlights pseudogenes as a complex and dynamic component of the human transcriptional landscape.
Collapse
|
9
|
Microdeletion of 9q22.3: A patient with minimal deletion size associated with a severe phenotype. Am J Med Genet A 2021; 185:2070-2083. [PMID: 33960642 PMCID: PMC8251932 DOI: 10.1002/ajmg.a.62224] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 03/17/2021] [Accepted: 04/02/2021] [Indexed: 01/20/2023]
Abstract
Basal cell nevus syndrome (also known as Gorlin Syndrome; MIM109400) is an autosomal dominant disorder characterized by recurrent pathological features such as basal cell carcinomas and odontogenic keratocysts as well as skeletal abnormalities. Most affected individuals have point mutations or small insertions or deletions within the PTCH1 gene on human chromosome 9, but there are some cases with more extensive deletion of the region, usually including the neighboring FANCC and/or ERCC6L2 genes. We report a 16‐year‐old patient with a deletion of approximately 400,000 bases which removes only PTCH1 and some non‐coding RNA genes but leaves FANCC and ERCC6L2 intact. In spite of the small amount of DNA for which he is haploid, his phenotype is more extreme than many individuals with longer deletions in the region. This includes early presentation with a large number of basal cell nevi and other skin lesions, multiple jaw keratocysts, and macrosomia. We found that the deletion was in the paternal chromosome, in common with other macrosomia cases. Using public databases, we have examined possible interactions between sequences within and outside the deletion and speculate that a regulatory relationship exists with flanking genes, which is unbalanced by the deletion, resulting in abnormal activation or repression of the target genes and hence the severity of the phenotype.
Collapse
|
10
|
Nanopore Sequencing Enables Comprehensive Transposable Element Epigenomic Profiling. Mol Cell 2020; 80:915-928.e5. [PMID: 33186547 DOI: 10.1016/j.molcel.2020.10.024] [Citation(s) in RCA: 83] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 10/14/2020] [Accepted: 10/15/2020] [Indexed: 12/12/2022]
Abstract
Transposable elements (TEs) drive genome evolution and are a notable source of pathogenesis, including cancer. While CpG methylation regulates TE activity, the locus-specific methylation landscape of mobile human TEs has to date proven largely inaccessible. Here, we apply new computational tools and long-read nanopore sequencing to directly infer CpG methylation of novel and extant TE insertions in hippocampus, heart, and liver, as well as paired tumor and non-tumor liver. As opposed to an indiscriminate stochastic process, we find pronounced demethylation of young long interspersed element 1 (LINE-1) retrotransposons in cancer, often distinct to the adjacent genome and other TEs. SINE-VNTR-Alu (SVA) retrotransposons, including their internal tandem repeat-associated CpG island, are near-universally methylated. We encounter allele-specific TE methylation and demethylation of aberrantly expressed young LINE-1s in normal tissues. Finally, we recover the complete sequences of tumor-specific LINE-1 insertions and their retrotransposition hallmarks, demonstrating how long-read sequencing can simultaneously survey the epigenome and detect somatic TE mobilization.
Collapse
|
11
|
LINE-1 Evasion of Epigenetic Repression in Humans. Mol Cell 2019; 75:590-604.e12. [PMID: 31230816 DOI: 10.1016/j.molcel.2019.05.024] [Citation(s) in RCA: 75] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Revised: 04/08/2019] [Accepted: 05/15/2019] [Indexed: 02/07/2023]
Abstract
Epigenetic silencing defends against LINE-1 (L1) retrotransposition in mammalian cells. However, the mechanisms that repress young L1 families and how L1 escapes to cause somatic genome mosaicism in the brain remain unclear. Here we report that a conserved Yin Yang 1 (YY1) transcription factor binding site mediates L1 promoter DNA methylation in pluripotent and differentiated cells. By analyzing 24 hippocampal neurons with three distinct single-cell genomic approaches, we characterized and validated a somatic L1 insertion bearing a 3' transduction. The source (donor) L1 for this insertion was slightly 5' truncated, lacked the YY1 binding site, and was highly mobile when tested in vitro. Locus-specific bisulfite sequencing revealed that the donor L1 and other young L1s with mutated YY1 binding sites were hypomethylated in embryonic stem cells, during neurodifferentiation, and in liver and brain tissue. These results explain how L1 can evade repression and retrotranspose in the human body.
Collapse
|
12
|
DamID as a versatile tool for understanding gene regulation. Development 2019; 146:146/6/dev173666. [PMID: 30877125 PMCID: PMC6451315 DOI: 10.1242/dev.173666] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2018] [Accepted: 02/18/2019] [Indexed: 12/22/2022]
Abstract
The interaction of proteins and RNA with chromatin underlies the regulation of gene expression. The ability to profile easily these interactions is fundamental for understanding chromatin biology in vivo. DNA adenine methyltransferase identification (DamID) profiles genome-wide protein-DNA interactions without antibodies, fixation or protein pull-downs. Recently, DamID has been adapted for applications beyond simple assaying of protein-DNA interactions, such as for studying RNA-chromatin interactions, chromatin accessibility and long-range chromosome interactions. Here, we provide an overview of DamID and introduce improvements to the technology, discuss their applications and compare alternative methodologies. Summary: This Primer provides an overview of DNA adenine methyltransferase identification (DamID), which is used to profile genome-wide chromatin interactions, and introduces recent improvements to the technology.
Collapse
|
13
|
Targeted DamID reveals differential binding of mammalian pluripotency factors. Development 2018; 145:dev.170209. [PMID: 30185410 DOI: 10.1242/dev.170209] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 08/23/2018] [Indexed: 12/14/2022]
Abstract
The precise control of gene expression by transcription factor networks is crucial to organismal development. The predominant approach for mapping transcription factor-chromatin interactions has been chromatin immunoprecipitation (ChIP). However, ChIP requires a large number of homogeneous cells and antisera with high specificity. A second approach, DamID, has the drawback that high levels of Dam methylase are toxic. Here, we modify our targeted DamID approach (TaDa) to enable cell type-specific expression in mammalian systems, generating an inducible system (mammalian TaDa or MaTaDa) to identify genome-wide protein/DNA interactions in 100 to 1000 times fewer cells than ChIP-based approaches. We mapped the binding sites of two key pluripotency factors, OCT4 and PRDM14, in mouse embryonic stem cells, epiblast-like cells and primordial germ cell-like cells (PGCLCs). PGCLCs are an important system for elucidating primordial germ cell development in mice. We monitored PRDM14 binding during the specification of PGCLCs, identifying direct targets of PRDM14 that are key to understanding its crucial role in PGCLC development. We show that MaTaDa is a sensitive and accurate method for assessing cell type-specific transcription factor binding in limited numbers of cells.
Collapse
|
14
|
RNA-DamID reveals cell-type-specific binding of roX RNAs at chromatin-entry sites. Nat Struct Mol Biol 2017; 25:109-114. [PMID: 29323275 PMCID: PMC5813796 DOI: 10.1038/s41594-017-0006-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 11/09/2017] [Indexed: 02/08/2023]
Abstract
Thousands of long noncoding RNAs (lncRNAs) have been identified in eukaryotic genomes, many of which are expressed in spatially and temporally restricted patterns. Nonetheless, the roles of the majority of these transcripts are still unknown. One of the mechanisms by which lncRNAs function is through the modulation of chromatin state. To assess the functions of lncRNAs we developed RNA-DamID, a novel approach that detects lncRNA-genome interactions in a cell-type specific manner in vivo with high sensitivity and accuracy. Identifying the cell-type-specific genome occupancy of lncRNAs is key to understanding their mechanisms of action in development and disease. We used RNA-DamID to investigate targeting of the lncRNAs in the Drosophila dosage compensation complex (DCC) and show that initial targeting is cell-type-specific.
Collapse
|
15
|
Cell-type-specific profiling of protein-DNA interactions without cell isolation using targeted DamID with next-generation sequencing. Nat Protoc 2016; 11:1586-98. [PMID: 27490632 DOI: 10.1038/nprot.2016.084] [Citation(s) in RCA: 73] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
This protocol is an extension to: Nat. Protoc. 2, 1467-1478 (2007); doi:10.1038/nprot.2007.148; published online 7 June 2007The ability to profile transcription and chromatin binding in a cell-type-specific manner is a powerful aid to understanding cell-fate specification and cellular function in multicellular organisms. We recently developed targeted DamID (TaDa) to enable genome-wide, cell-type-specific profiling of DNA- and chromatin-binding proteins in vivo without cell isolation. As a protocol extension, this article describes substantial modifications to an existing protocol, and it offers additional applications. TaDa builds upon DamID, a technique for detecting genome-wide DNA-binding profiles of proteins, by coupling it with the GAL4 system in Drosophila to enable both temporal and spatial resolution. TaDa ensures that Dam-fusion proteins are expressed at very low levels, thus avoiding toxicity and potential artifacts from overexpression. The modifications to the core DamID technique presented here also increase the speed of sample processing and throughput, and adapt the method to next-generation sequencing technology. TaDa is robust, reproducible and highly sensitive. Compared with other methods for cell-type-specific profiling, the technique requires no cell-sorting, cross-linking or antisera, and binding profiles can be generated from as few as 10,000 total induced cells. By profiling the genome-wide binding of RNA polymerase II (Pol II), TaDa can also identify transcribed genes in a cell-type-specific manner. Here we describe a detailed protocol for carrying out TaDa experiments and preparing the material for next-generation sequencing. Although we developed TaDa in Drosophila, it should be easily adapted to other organisms with an inducible expression system. Once transgenic animals are obtained, the entire experimental procedure-from collecting tissue samples to generating sequencing libraries-can be accomplished within 5 d.
Collapse
|
16
|
The Evx1/Evx1as gene locus regulates anterior-posterior patterning during gastrulation. Sci Rep 2016; 6:26657. [PMID: 27226347 PMCID: PMC4880930 DOI: 10.1038/srep26657] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2016] [Accepted: 04/29/2016] [Indexed: 01/09/2023] Open
Abstract
Thousands of sense-antisense mRNA-lncRNA gene pairs occur in the mammalian genome. While there is usually little doubt about the function of the coding transcript, the function of the lncRNA partner is mostly untested. Here we examine the function of the homeotic Evx1-Evx1as gene locus. Expression is tightly co-regulated in posterior mesoderm of mouse embryos and in embryoid bodies. Expression of both genes is enhanced by BMP4 and WNT3A, and reduced by Activin. We generated a suite of deletions in the locus by CRISPR-Cas9 editing. We show EVX1 is a critical downstream effector of BMP4 and WNT3A with respect to patterning of posterior mesoderm. The lncRNA, Evx1as arises from alternative promoters and is difficult to fully abrogate by gene editing or siRNA approaches. Nevertheless, we were able to generate a large 2.6 kb deletion encompassing the shared promoter with Evx1 and multiple additional exons of Evx1as. This led to an identical dorsal-ventral patterning defect to that generated by micro-deletion in the DNA-binding domain of EVX1. Thus, Evx1as has no function independent of EVX1, and is therefore unlikely to act in trans. We predict many antisense lncRNAs have no specific trans function, possibly only regulating the linked coding genes in cis.
Collapse
|
17
|
Freedom of expression: cell-type-specific gene profiling. WILEY INTERDISCIPLINARY REVIEWS-DEVELOPMENTAL BIOLOGY 2014; 3:429-43. [PMID: 25174322 DOI: 10.1002/wdev.149] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Accepted: 07/10/2014] [Indexed: 12/17/2022]
Abstract
Cell fate and behavior are results of differential gene regulation, making techniques to profile gene expression in specific cell types highly desirable. Many methods now enable investigation at the DNA, RNA and protein level. This review introduces the most recent and popular techniques, and discusses key issues influencing the choice between these such as ease, cost and applicability of information gained. Interdisciplinary collaborations will no doubt contribute further advances, including not just in single cell type but single-cell expression profiling.
Collapse
|
18
|
Long noncoding RNAs and the genetics of cancer. Br J Cancer 2013; 108:2419-25. [PMID: 23660942 PMCID: PMC3694235 DOI: 10.1038/bjc.2013.233] [Citation(s) in RCA: 596] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2013] [Revised: 04/03/2013] [Accepted: 04/11/2013] [Indexed: 02/07/2023] Open
Abstract
Cancer is a disease of aberrant gene expression. While the genetic causes of cancer have been intensively studied, it is becoming evident that a large proportion of cancer susceptibility cannot be attributed to variation in protein-coding sequences. This is highlighted by genome-wide association studies in cancer that reveal that more than 80% of cancer-associated SNPs occur in noncoding regions of the genome. In this review, we posit that a significant fraction of the genetic aetiology of cancer is exacted by noncoding regulatory sequences, particularly by long noncoding RNAs (lncRNAs). Recent studies indicate that several cancer risk loci are transcribed into lncRNAs and these transcripts play key roles in tumorigenesis. We discuss the epigenetic and other mechanisms through which lncRNAs function and how they contribute to each stage of cancer progression, understanding of which will be crucial for realising new opportunities in cancer diagnosis and treatment. Long noncoding RNAs play important roles in almost every aspect of cell biology from nuclear organisation and epigenetic regulation to post-transcriptional regulation and splicing, and we link these processes to the hallmarks and genetics of cancer. Finally, we highlight recent progress and future potential in the application of lncRNAs as therapeutic targets and diagnostic markers.
Collapse
|
19
|
|
20
|
Pinstripe: a suite of programs for integrating transcriptomic and proteomic datasets identifies novel proteins and improves differentiation of protein-coding and non-coding genes. Bioinformatics 2012; 28:3042-50. [PMID: 23044541 DOI: 10.1093/bioinformatics/bts582] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Comparing transcriptomic data with proteomic data to identify protein-coding sequences is a long-standing challenge in molecular biology, one that is exacerbated by the increasing size of high-throughput datasets. To address this challenge, and thereby to improve the quality of genome annotation and understanding of genome biology, we have developed an integrated suite of programs, called Pinstripe. We demonstrate its application, utility and discovery power using transcriptomic and proteomic data from publicly available datasets. RESULTS To demonstrate the efficacy of Pinstripe for large-scale analysis, we applied Pinstripe's reverse peptide mapping pipeline to a transcript library including de novo assembled transcriptomes from the human Illumina Body Atlas (IBA2) and GENCODE v10 gene annotations, and the EBI Proteomics Identifications Database (PRIDE) peptide database. This analysis identified 736 canonical open reading frames (ORFs) supported by three or more PRIDE peptide fragments that are positioned outside any known coding DNA sequence (CDS). Because of the unfiltered nature of the PRIDE database and high probability of false discovery, we further refined this list using independent evidence for translation, including the presence of a Kozak sequence or functional domains, synonymous/non-synonymous substitution ratios and ORF length. Using this integrative approach, we observed evidence of translation from a previously unknown let7e primary transcript, the archetypical lncRNA H19, and a homolog of RD3. Reciprocally, by exclusion of transcripts with mapped peptides or significant ORFs (>80 codon), we identify 32 187 loci with RNAs longer than 2000 nt that are unlikely to encode proteins. AVAILABILITY AND IMPLEMENTATION Pinstripe (pinstripe.matticklab.com) is freely available as source code or a Mono binary. Pinstripe is written in C# and runs under the Mono framework on Linux or Mac OS X, and both under Mono and .Net under Windows. CONTACT m.dinger@garvan.org.au or j.mattick@garvan.org.au SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|