1
|
Ueda MT, Inamo J, Miya F, Shimada M, Yamaguchi K, Kochi Y. Functional and dynamic profiling of transcript isoforms reveals essential roles of alternative splicing in interferon response. CELL GENOMICS 2024; 4:100654. [PMID: 39288763 DOI: 10.1016/j.xgen.2024.100654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 04/04/2024] [Accepted: 08/20/2024] [Indexed: 09/19/2024]
Abstract
Type I interferon (IFN-I) plays an important role in the innate immune response through inducing IFN-I-stimulated genes (ISGs). However, how alternative splicing (AS) events, especially over time, affect their function remains poorly understood. We generated an annotation (113,843 transcripts) for IFN-I-stimulated human B cells called isoISG using high-accuracy long-read sequencing data from PacBio Sequel II/IIe. Transcript isoform profiling using isoISG revealed that isoform switching occurred in the early response to IFN-I so that ISGs would gain functional domains (e.g., C4B) or higher protein production (e.g., IRF3). Conversely, isoforms lacking functional domains increased during the late phase of IFN-I response, mainly due to intron retention events. This suggests that isoform switching both triggers and terminates IFN-I responses at the translation and protein levels. Furthermore, genetic variants influencing the isoform ratio of ISGs were associated with immunological and infectious diseases. AS has essential roles in regulating innate immune response and associated diseases.
Collapse
Affiliation(s)
- Mahoko Takahashi Ueda
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan
| | - Jun Inamo
- Division of Rheumatology, University of Colorado School of Medicine, Aurora, CO, USA; Department of Biomedical Informatics, Center for Health Artificial Intelligence, University of Colorado School of Medicine, Aurora, CO, USA
| | - Fuyuki Miya
- Center for Medical Genetics, Keio University School of Medicine, Tokyo 160-8582, Japan
| | - Mihoko Shimada
- National Center for Global Health and Medicine, Tokyo 162-8655, Japan
| | - Kensuke Yamaguchi
- Biomedical Engineering Research Innovation Center, Institute of Biomaterials and Bioengineering, Tokyo Medical and Dental University, Tokyo 113-8510, Japan; Laboratory for Autoimmune Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, Japan; Department of Allergy and Rheumatology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
| | - Yuta Kochi
- Department of Genomic Function and Diversity, Medical Research Institute, Tokyo Medical and Dental University, Tokyo 113-8510, Japan; Department of Allergy and Rheumatology, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
2
|
Furci L, Berthelier J, Saze H. RNA N6-adenine methylation dynamics impact Hyaloperonospora arabidopsidis resistance in Arabidopsis. PLANT PHYSIOLOGY 2024; 196:745-753. [PMID: 38991559 DOI: 10.1093/plphys/kiae373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 06/03/2024] [Accepted: 06/16/2024] [Indexed: 07/13/2024]
Abstract
In plants, epitranscriptomic mark N6-adenine methylation (m6A) is dynamically regulated in response to environmental cues. However, little is known about m6A dynamics under biotic stresses and their role in environmental adaptation. Additionally, current methodologies limit the investigation of m6A dynamics at single-nucleotide resolution on specific RNA molecules. Using Oxford Nanopore Technology direct RNA sequencing and a neural network model, we show transcript-specific dynamics of m6A modification at single-nucleotide resolution during Hyaloperonospora arabidopsidis (Hpa) infection in Arabidopsis (Arabidopsis thaliana). In wild-type seedlings, pathogen infection causes a significant reduction in global m6A ratios, which corresponds with the activation of m6A-modified transcripts. Defect of m6A deposition in the m6A mutant hakai-1 mimics m6A reduction from Hpa infection at ∼70% of sites, resulting in constitutive overexpression of basal defense genes and enhanced resistance against the pathogen. Our results demonstrate that m6A dynamics impact defense response against Hpa, providing a promising target for future crop improvement strategies.
Collapse
Affiliation(s)
- Leonardo Furci
- Plant Epigenetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0412, Japan
| | - Jérémy Berthelier
- Plant Epigenetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0412, Japan
| | - Hidetoshi Saze
- Plant Epigenetics Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa 904-0412, Japan
| |
Collapse
|
3
|
Zhang Y, Zhang T, Yang G, Pan Z, Tang M, Wen Y, He P, Wang Y, Zhou R. PerturbAtlas: a comprehensive atlas of public genetic perturbation bulk RNA-seq datasets. Nucleic Acids Res 2024:gkae851. [PMID: 39351872 DOI: 10.1093/nar/gkae851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 09/12/2024] [Accepted: 09/17/2024] [Indexed: 10/03/2024] Open
Abstract
Manipulating gene expression is crucial for understanding gene function, with high-throughput sequencing techniques such as RNA-seq elucidating the downstream mechanisms involved. However, the lack of a standardized metadata format for small-scale perturbation expression datasets in public repositories hinders their reuse. To address this issue, we developed PerturbAtlas, an add-value resource that re-analyzes publicly archived RNA-seq libraries to provide quantitative data on gene expression, transcript profiles, and alternative splicing events following genetic perturbation. PerturbAtlas assists users in identifying trends at the gene and isoform levels in perturbation assays by re-analyzing a curated set of 122 801 RNA-seq libraries across 13 species. This resource is freely available at https://perturbatlas.kratoss.site as both raw data tables and an interactive browser, allowing searches by species, tissue or genomic features. The results provide detailed information on alterations following perturbations, accessible through both forward and reverse approaches, thereby enabling the exploration of perturbation consequences and the identification of potential causal perturbations.
Collapse
Affiliation(s)
- Yiming Zhang
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
- Institute of Thoracic Oncology and Department of Thoracic Surgery, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Ting Zhang
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Gaoxia Yang
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Zhenzhong Pan
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Min Tang
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Yue Wen
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Ping He
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Yuan Wang
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Ran Zhou
- Department of Neurosurgery and State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| |
Collapse
|
4
|
Tiberi S, Meili J, Cai P, Soneson C, He D, Sarkar H, Avalos-Pacheco A, Patro R, Robinson MD. DifferentialRegulation: a Bayesian hierarchical approach to identify differentially regulated genes. Biostatistics 2024; 25:1079-1093. [PMID: 38887902 DOI: 10.1093/biostatistics/kxae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 03/21/2024] [Accepted: 05/15/2024] [Indexed: 06/20/2024] Open
Abstract
Although transcriptomics data is typically used to analyze mature spliced mRNA, recent attention has focused on jointly investigating spliced and unspliced (or precursor-) mRNA, which can be used to study gene regulation and changes in gene expression production. Nonetheless, most methods for spliced/unspliced inference (such as RNA velocity tools) focus on individual samples, and rarely allow comparisons between groups of samples (e.g. healthy vs. diseased). Furthermore, this kind of inference is challenging, because spliced and unspliced mRNA abundance is characterized by a high degree of quantification uncertainty, due to the prevalence of multi-mapping reads, ie reads compatible with multiple transcripts (or genes), and/or with both their spliced and unspliced versions. Here, we present DifferentialRegulation, a Bayesian hierarchical method to discover changes between experimental conditions with respect to the relative abundance of unspliced mRNA (over the total mRNA). We model the quantification uncertainty via a latent variable approach, where reads are allocated to their gene/transcript of origin, and to the respective splice version. We designed several benchmarks where our approach shows good performance, in terms of sensitivity and error control, vs. state-of-the-art competitors. Importantly, our tool is flexible, and works with both bulk and single-cell RNA-sequencing data. DifferentialRegulation is distributed as a Bioconductor R package.
Collapse
Affiliation(s)
- Simone Tiberi
- Department of Statistical Sciences, University of Bologna, Via delle Belle Arti 41, Bologna, 40126, Italy
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Winterthurerstrasse 190, Zurich, 8057, Switzerland
| | - Joël Meili
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Winterthurerstrasse 190, Zurich, 8057, Switzerland
| | - Peiying Cai
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Winterthurerstrasse 190, Zurich, 8057, Switzerland
| | - Charlotte Soneson
- Computational Biology Platform, Friedrich Miescher Institute for Biomedical Research and SIB Swiss Institute of Bioinformatics, Fabrikstrasse 24, Basel, 4056, Switzerland
| | - Dongze He
- Department of Cell Biology and Molecular Genetics, University of Maryland, 4062 Campus Drive, College Park, MD 20742, United States
- Center for Bioinformatics and Computational Biology, University of Maryland, 8125 Paint Branch Dr, College Park, MD 20742, United States
| | - Hirak Sarkar
- Department of Computer Science, Princeton University, 35 Olden St, Princeton, NJ 08540, United States
| | - Alejandra Avalos-Pacheco
- Research Unit of Applied Statistics, TU Wien, Wiedner Hauptstrabe 8-10/105, Wien 1040, Austria
- Harvard-MIT Center for Regulatory Science, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115200 Longwood Avenue, Boston, MA 02115, United States
| | - Rob Patro
- Center for Bioinformatics and Computational Biology, University of Maryland, 8125 Paint Branch Dr, College Park, MD 20742, United States
- Department of Computer Science, University of Maryland, 8125 Paint Branch Dr, College Park, MD 20742, United States
| | - Mark D Robinson
- Department of Molecular Life Sciences and SIB Swiss Institute of Bioinformatics, University of Zurich, Winterthurerstrasse 190, Zurich, 8057, Switzerland
| |
Collapse
|
5
|
Wang D, Gazzara MR, Jewell S, Wales-McGrath B, Brown CD, Choi PS, Barash Y. A Deep Dive into Statistical Modeling of RNA Splicing QTLs Reveals New Variants that Explain Neurodegenerative Disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.01.610696. [PMID: 39282456 PMCID: PMC11398334 DOI: 10.1101/2024.09.01.610696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
Genome-wide association studies (GWAS) have identified thousands of putative disease causing variants with unknown regulatory effects. Efforts to connect these variants with splicing quantitative trait loci (sQTLs) have provided functional insights, yet sQTLs reported by existing methods cannot explain many GWAS signals. We show current sQTL modeling approaches can be improved by considering alternative splicing representation, model calibration, and covariate integration. We then introduce MAJIQTL, a new pipeline for sQTL discovery. MAJIQTL includes two new statistical methods: a weighted multiple testing approach for sGene discovery and a model for sQTL effect size inference to improve variant prioritization. By applying MAJIQTL to GTEx, we find significantly more sGenes harboring sQTLs with functional significance. Notably, our analysis implicates the novel variant rs582283 in Alzheimer's disease. Using antisense oligonucleotides, we validate this variant's effect by blocking the implicated YBX3 binding site, leading to exon skipping in the gene MS4A3.
Collapse
Affiliation(s)
- David Wang
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania
| | - Matthew R. Gazzara
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania
| | - San Jewell
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania
| | | | | | - Peter S. Choi
- Department of Pathology & Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania
- Division of Cancer Pathobiology, The Children’s Hospital of Philadelphia
| | - Yoseph Barash
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania
- Department of Computer and Information Sciences, School of Engineering, University of Pennsylvania
| |
Collapse
|
6
|
De Rijk P, Watzeels T, Küçükali F, Van Dongen J, Faura J, Willems P, De Deyn L, Duchateau L, Grones C, Eekhout T, De Pooter T, Joris G, Rombauts S, De Rybel B, Rademakers R, Van Breusegem F, Strazisar M, Sleegers K, De Coster W. Scywalker: scalable end-to-end data analysis workflow for long-read single-cell transcriptome sequencing. Bioinformatics 2024; 40:btae549. [PMID: 39254601 PMCID: PMC11419950 DOI: 10.1093/bioinformatics/btae549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 08/29/2024] [Accepted: 09/06/2024] [Indexed: 09/11/2024] Open
Abstract
MOTIVATION Existing nanopore single-cell data analysis tools showed severe limitations in handling current data sizes. RESULTS We introduce scywalker, an innovative and scalable package developed to comprehensively analyze long-read sequencing data of full-length single-cell or single-nuclei cDNA. We developed novel scalable methods for cell barcode demultiplexing and single-cell isoform calling and quantification and incorporated these in an easily deployable package. Scywalker streamlines the entire analysis process, from sequenced fragments in FASTQ format to demultiplexed pseudobulk isoform counts, into a single command suitable for execution on either server or cluster. Scywalker includes data quality control, cell type identification, and an interactive report. Assessment of datasets from the human brain, Arabidopsis leaves, and previously benchmarked data from mixed cell lines demonstrate excellent correlation with short-read analyses at both the cell-barcoding and gene quantification levels. At the isoform level, we show that scywalker facilitates the direct identification of cell-type-specific expression of novel isoforms. AVAILABILITY AND IMPLEMENTATION Scywalker is available on github.com/derijkp/scywalker under the GNU General Public License (GPL) and at https://zenodo.org/records/13359438/files/scywalker-0.108.0-Linux-x86_64.tar.gz.
Collapse
Affiliation(s)
- Peter De Rijk
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Universiteitsplein 1, Antwerp, 2610, Belgium
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
| | - Tijs Watzeels
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Complex Genetics of Alzheimer’s Disease Group, VIB Center for Molecular Neurology, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Fahri Küçükali
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Complex Genetics of Alzheimer’s Disease Group, VIB Center for Molecular Neurology, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Jasper Van Dongen
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Complex Genetics of Alzheimer’s Disease Group, VIB Center for Molecular Neurology, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Júlia Faura
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Applied and Translational Neurogenomics Group, VIB Center for Molecular Neurology, Universiteitsplein 1, Antwerp, 2610, Belgium
| | - Patrick Willems
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Center for Plant Systems Biology, VIB, Technologiepark 71, Zwijnaarde, 9052, Belgium
- Department of Biomolecular Medicine, Ghent University, Corneel Heymanslaan 10, Ghent, 9000, Belgium
- VIB Center for Medical Biotechnology, VIB, Technologiepark-Zwijnaarde 75, Ghent, 9052, Belgium
| | - Lara De Deyn
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Complex Genetics of Alzheimer’s Disease Group, VIB Center for Molecular Neurology, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Lena Duchateau
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Complex Genetics of Alzheimer’s Disease Group, VIB Center for Molecular Neurology, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Carolin Grones
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Center for Plant Systems Biology, VIB, Technologiepark 71, Zwijnaarde, 9052, Belgium
| | - Thomas Eekhout
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Center for Plant Systems Biology, VIB, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Single Cell Core, VIB, Technologiepark-Zwijnaarde 71, Ghent, 9052, Belgium
| | - Tim De Pooter
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Universiteitsplein 1, Antwerp, 2610, Belgium
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
| | - Geert Joris
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Universiteitsplein 1, Antwerp, 2610, Belgium
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
| | - Stephane Rombauts
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Center for Plant Systems Biology, VIB, Technologiepark 71, Zwijnaarde, 9052, Belgium
| | - Bert De Rybel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Center for Plant Systems Biology, VIB, Technologiepark 71, Zwijnaarde, 9052, Belgium
| | - Rosa Rademakers
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Applied and Translational Neurogenomics Group, VIB Center for Molecular Neurology, Universiteitsplein 1, Antwerp, 2610, Belgium
| | - Frank Van Breusegem
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Technologiepark 71, Zwijnaarde, 9052, Belgium
- VIB Center for Plant Systems Biology, VIB, Technologiepark 71, Zwijnaarde, 9052, Belgium
| | - Mojca Strazisar
- Neuromics Support Facility, VIB Center for Molecular Neurology, VIB, Universiteitsplein 1, Antwerp, 2610, Belgium
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
| | - Kristel Sleegers
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Complex Genetics of Alzheimer’s Disease Group, VIB Center for Molecular Neurology, Universiteitsplein 1, 2610, Antwerp, Belgium
| | - Wouter De Coster
- Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Antwerp, 2610, Belgium
- Applied and Translational Neurogenomics Group, VIB Center for Molecular Neurology, Universiteitsplein 1, Antwerp, 2610, Belgium
| |
Collapse
|
7
|
Kiseleva OI, Arzumanian VA, Kurbatov IY, Poverennaya EV. In silico and in cellulo approaches for functional annotation of human protein splice variants. BIOMEDITSINSKAIA KHIMIIA 2024; 70:315-328. [PMID: 39324196 DOI: 10.18097/pbmc20247005315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2024]
Abstract
The elegance of pre-mRNA splicing mechanisms continues to interest scientists even after over a half century, since the discovery of the fact that coding regions in genes are interrupted by non-coding sequences. The vast majority of human genes have several mRNA variants, coding structurally and functionally different protein isoforms in a tissue-specific manner and with a linkage to specific developmental stages of the organism. Alteration of splicing patterns shifts the balance of functionally distinct proteins in living systems, distorts normal molecular pathways, and may trigger the onset and progression of various pathologies. Over the past two decades, numerous studies have been conducted in various life sciences disciplines to deepen our understanding of splicing mechanisms and the extent of their impact on the functioning of living systems. This review aims to summarize experimental and computational approaches used to elucidate the functions of splice variants of a single gene based on our experience accumulated in the laboratory of interactomics of proteoforms at the Institute of Biomedical Chemistry (IBMC) and best global practices.
Collapse
Affiliation(s)
- O I Kiseleva
- Institute of Biomedical Chemistry, Moscow, Russia
| | | | | | | |
Collapse
|
8
|
Choquet K, Chaumont LP, Bache S, Baxter-Koenigs AR, Churchman LS. Genetic regulation of nascent RNA maturation revealed by direct RNA nanopore sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.29.610338. [PMID: 39257732 PMCID: PMC11383983 DOI: 10.1101/2024.08.29.610338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Quantitative trait loci analyses have revealed an important role for genetic variants in regulating alternative splicing (AS) and alternative cleavage and polyadenylation (APA) in humans. Yet, these studies are generally performed with mature mRNA, so they report on the outcome rather than the processes of RNA maturation and thus may overlook how variants directly modulate pre-mRNA processing. The order in which the many introns of a human gene are removed can substantially influence AS, while nascent RNA polyadenylation can affect RNA stability and decay. However, how splicing order and poly(A) tail length are regulated by genetic variation has never been explored. Here, we used direct RNA nanopore sequencing to investigate allele-specific pre-mRNA maturation in 12 human lymphoblastoid cell lines. We found frequent splicing order differences between alleles and uncovered significant single nucleotide polymorphism (SNP)-splicing order associations in 17 genes. This included SNPs located in or near splice sites as well as more distal intronic and exonic SNPs. Moreover, several genes showed allele-specific poly(A) tail lengths, many of which also had a skewed allelic abundance ratio. HLA class I transcripts, which encode proteins that play an essential role in antigen presentation, showed the most allele-specific splicing orders, which frequently co-occurred with allele-specific AS, APA or poly(A) tail length differences. Together, our results expose new layers of genetic regulation of pre-mRNA maturation and highlight the power of long-read RNA sequencing for allele-specific analyses.
Collapse
Affiliation(s)
- Karine Choquet
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Canada
- Research Centre on Aging, CIUSSS de l’Estrie-CHUS, Sherbrooke, Canada
| | - Louis-Philippe Chaumont
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Canada
- Research Centre on Aging, CIUSSS de l’Estrie-CHUS, Sherbrooke, Canada
| | - Simon Bache
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, Canada
- Research Centre on Aging, CIUSSS de l’Estrie-CHUS, Sherbrooke, Canada
| | | | | |
Collapse
|
9
|
Mehta P, Liu CSC, Sinha S, Mohite R, Arora S, Chattopadhyay P, Budhiraja S, Tarai B, Pandey R. Reduced protein-coding transcript diversity in severe dengue emphasises the role of alternative splicing. Life Sci Alliance 2024; 7:e202402683. [PMID: 38830771 PMCID: PMC11147948 DOI: 10.26508/lsa.202402683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 05/22/2024] [Accepted: 05/22/2024] [Indexed: 06/05/2024] Open
Abstract
Dengue fever, a neglected tropical arboviral disease, has emerged as a global health concern in the past decade. Necessitating a nuanced comprehension of the intricate dynamics of host-virus interactions influencing disease severity, we analysed transcriptomic patterns using bulk RNA-seq from 112 age- and gender-matched NS1 antigen-confirmed hospital-admitted dengue patients with varying severity. Severe cases exhibited reduced platelet count, increased lymphocytosis, and neutropenia, indicating a dysregulated immune response. Using bulk RNA-seq, our analysis revealed a minimal overlap between the differentially expressed gene and transcript isoform, with a distinct expression pattern across the disease severity. Severe patients showed enrichment in retained intron and nonsense-mediated decay transcript biotypes, suggesting altered splicing efficiency. Furthermore, an up-regulated programmed cell death, a haemolytic response, and an impaired interferon and antiviral response at the transcript level were observed. We also identified the potential involvement of the RBM39 gene among others in the innate immune response during dengue viral pathogenesis, warranting further investigation. These findings provide valuable insights into potential therapeutic targets, underscoring the importance of exploring transcriptomic landscapes between different disease sub-phenotypes in infectious diseases.
Collapse
Affiliation(s)
- Priyanka Mehta
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Chinky Shiu Chen Liu
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Sristi Sinha
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Ramakant Mohite
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Smriti Arora
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
| | - Partha Chattopadhyay
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Sandeep Budhiraja
- https://ror.org/00e7r7m66 Max Super Speciality Hospital (A Unit of Devki Devi Foundation), Max Healthcare, Delhi, India
| | - Bansidhar Tarai
- https://ror.org/00e7r7m66 Max Super Speciality Hospital (A Unit of Devki Devi Foundation), Max Healthcare, Delhi, India
| | - Rajesh Pandey
- https://ror.org/05ef28661 Division of Immunology and Infectious Disease Biology, INtegrative GENomics of HOst-PathogEn (INGEN-HOPE) Laboratory, CSIR-Institute of Genomics and Integrative Biology (CSIR-IGIB), Delhi, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
10
|
Occean JR, Yang N, Sun Y, Dawkins MS, Munk R, Belair C, Dar S, Anerillas C, Wang L, Shi C, Dunn C, Bernier M, Price NL, Kim JS, Cui CY, Fan J, Bhattacharyya M, De S, Maragkakis M, de Cabo R, Sidoli S, Sen P. Gene body DNA hydroxymethylation restricts the magnitude of transcriptional changes during aging. Nat Commun 2024; 15:6357. [PMID: 39069555 PMCID: PMC11284234 DOI: 10.1038/s41467-024-50725-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 07/15/2024] [Indexed: 07/30/2024] Open
Abstract
DNA hydroxymethylation (5hmC), the most abundant oxidative derivative of DNA methylation, is typically enriched at enhancers and gene bodies of transcriptionally active and tissue-specific genes. Although aberrant genomic 5hmC has been implicated in age-related diseases, its functional role in aging remains unknown. Here, using mouse liver and cerebellum as model organs, we show that 5hmC accumulates in gene bodies associated with tissue-specific function and restricts the magnitude of gene expression changes with age. Mechanistically, 5hmC decreases the binding of splicing associated factors and correlates with age-related alternative splicing events. We found that various age-related contexts, such as prolonged quiescence and senescence, drive the accumulation of 5hmC with age. We provide evidence that this age-related transcriptionally restrictive function is conserved in mouse and human tissues. Our findings reveal that 5hmC regulates tissue-specific function and may play a role in longevity.
Collapse
Affiliation(s)
- James R Occean
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Na Yang
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Yan Sun
- Department of Biochemistry, Albert Einstein School of Medicine, Bronx, NY, USA
| | - Marshall S Dawkins
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Rachel Munk
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Cedric Belair
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Showkat Dar
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Carlos Anerillas
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Lin Wang
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Changyou Shi
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Christopher Dunn
- Flow Cytometry Unit, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Michel Bernier
- Translational Gerontology Branch, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Nathan L Price
- Translational Gerontology Branch, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Julie S Kim
- Department of Biochemistry, Albert Einstein School of Medicine, Bronx, NY, USA
| | - Chang-Yi Cui
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Jinshui Fan
- Computational Biology and Genomics Core, Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | | | - Supriyo De
- Computational Biology and Genomics Core, Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Manolis Maragkakis
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Rafael de Cabo
- Translational Gerontology Branch, National Institute on Aging, NIH, Baltimore, MD, USA
| | - Simone Sidoli
- Department of Biochemistry, Albert Einstein School of Medicine, Bronx, NY, USA
| | - Payel Sen
- Laboratory of Genetics and Genomics, National Institute on Aging, NIH, Baltimore, MD, USA.
| |
Collapse
|
11
|
Occean JR, Yang N, Sun Y, Dawkins MS, Munk R, Belair C, Dar S, Anerillas C, Wang L, Shi C, Dunn C, Bernier M, Price NL, Kim JS, Cui CY, Fan J, Bhattacharyya M, De S, Maragkakis M, deCabo R, Sidoli S, Sen P. Gene body DNA hydroxymethylation restricts the magnitude of transcriptional changes during aging. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.15.528714. [PMID: 36824863 PMCID: PMC9949049 DOI: 10.1101/2023.02.15.528714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
DNA hydroxymethylation (5hmC), the most abundant oxidative derivative of DNA methylation, is typically enriched at enhancers and gene bodies of transcriptionally active and tissue-specific genes. Although aberrant genomic 5hmC has been implicated in age-related diseases, its functional role in aging remains unknown. Here, using mouse liver and cerebellum as model organs, we show that 5hmC accumulates in gene bodies associated with tissue-specific function and restricts the magnitude of gene expression changes with age. Mechanistically, 5hmC decreases the binding of splicing associated factors and correlates with age-related alternative splicing events. We found that various age-related contexts, such as prolonged quiescence and senescence, drive the accumulation of 5hmC with age. We provide evidence that this age-related transcriptionally restrictive function is conserved in mouse and human tissues. Our findings reveal that 5hmC regulates tissue-specific function and may play a role in longevity.
Collapse
|
12
|
Tang AD, Felton C, Hrabeta-Robinson E, Volden R, Vollmers C, Brooks AN. Detecting haplotype-specific transcript variation in long reads with FLAIR2. Genome Biol 2024; 25:173. [PMID: 38956576 PMCID: PMC11218413 DOI: 10.1186/s13059-024-03301-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 06/06/2024] [Indexed: 07/04/2024] Open
Abstract
BACKGROUND RNA-seq has brought forth significant discoveries regarding aberrations in RNA processing, implicating these RNA variants in a variety of diseases. Aberrant splicing and single nucleotide variants (SNVs) in RNA have been demonstrated to alter transcript stability, localization, and function. In particular, the upregulation of ADAR, an enzyme that mediates adenosine-to-inosine editing, has been previously linked to an increase in the invasiveness of lung adenocarcinoma cells and associated with splicing regulation. Despite the functional importance of studying splicing and SNVs, the use of short-read RNA-seq has limited the community's ability to interrogate both forms of RNA variation simultaneously. RESULTS We employ long-read sequencing technology to obtain full-length transcript sequences, elucidating cis-effects of variants on splicing changes at a single molecule level. We develop a computational workflow that augments FLAIR, a tool that calls isoform models expressed in long-read data, to integrate RNA variant calls with the associated isoforms that bear them. We generate nanopore data with high sequence accuracy from H1975 lung adenocarcinoma cells with and without knockdown of ADAR. We apply our workflow to identify key inosine isoform associations to help clarify the prominence of ADAR in tumorigenesis. CONCLUSIONS Ultimately, we find that a long-read approach provides valuable insight toward characterizing the relationship between RNA variants and splicing patterns.
Collapse
Affiliation(s)
- Alison D Tang
- Department of Biomolecular Engineering, University of California, Santa Cruz, USA
| | - Colette Felton
- Department of Biomolecular Engineering, University of California, Santa Cruz, USA
| | - Eva Hrabeta-Robinson
- Department of Biomolecular Engineering, University of California, Santa Cruz, USA
| | - Roger Volden
- Department of Biomolecular Engineering, University of California, Santa Cruz, USA
| | - Christopher Vollmers
- Department of Biomolecular Engineering, University of California, Santa Cruz, USA
| | - Angela N Brooks
- Department of Biomolecular Engineering, University of California, Santa Cruz, USA.
| |
Collapse
|
13
|
Pathak R, Esnault C, Radhakrishnan R, Singh PK, Zhang H, Dale R, Anand A, Bedwell GJ, Engelman AN, Rabi A, Hormoz S, Singh P, Levin HL. The role of LEDGF in transcription is exploited by HIV-1 to position integration. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.29.601340. [PMID: 39005447 PMCID: PMC11244883 DOI: 10.1101/2024.06.29.601340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
HIV-1 integration occurs across actively transcribed genes due to the interaction of integrase with host chromatin factor LEDGF. Although LEDGF was originally isolated as a co-activator that stimulates promoter activity in purified systems, this role is inconsistent with LEDGF-mediated integration across gene bodies and with data indicating LEDGF is a histone chaperone that promotes transcriptional elongation. We found LEDGF is enriched in pronounced peaks that match the enrichments of H3K4me3 and RNA Pol II at transcription start sites (TSSs) of active promoters. Our genome-wide chromatin mapping revealed that MLL1 had a dominant role in recruiting LEDGF to promoters and the presence of LEDGF recruits RNA Pol II. Enrichment of LEDGF at TSSs correlates strongly with levels of integration across the transcribed sequences, indicating that LEDGF at TSSs contributed to integration across gene bodies. Although the N-terminal Pro-Trp-Trp-Pro (PWWP) domain of LEDGF interacts with nucleosomes containing H3K36me3, a modification thought to recruit LEDGF to chromatin, we found H3K36me3 does not contribute to gene specificity of integration. These data support a dual role model of LEDGF where it is tethered to promoters by MLL1 and recruits RNA Pol II. Subsequently, LEDGF travels across genes to effect HIV-1 integration. Our data also provides a mechanistic context for the contribution made by LEDGF to MLL1-based infant acute leukemia and acute myeloid leukemia in adults.
Collapse
|
14
|
Halasz H, Malekos E, Covarrubias S, Yitiz S, Montano C, Sudek L, Katzman S, Liu SJ, Horlbeck MA, Namvar L, Weissman JS, Carpenter S. CRISPRi screens identify the lncRNA, LOUP, as a multifunctional locus regulating macrophage differentiation and inflammatory signaling. Proc Natl Acad Sci U S A 2024; 121:e2322524121. [PMID: 38781216 PMCID: PMC11145268 DOI: 10.1073/pnas.2322524121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 04/16/2024] [Indexed: 05/25/2024] Open
Abstract
Long noncoding RNAs (lncRNAs) account for the largest portion of RNA from the transcriptome, yet most of their functions remain unknown. Here, we performed two independent high-throughput CRISPRi screens to understand the role of lncRNAs in monocyte function and differentiation. The first was a reporter-based screen to identify lncRNAs that regulate TLR4-NFkB signaling in human monocytes and the second screen identified lncRNAs involved in monocyte to macrophage differentiation. We successfully identified numerous noncoding and protein-coding genes that can positively or negatively regulate inflammation and differentiation. To understand the functional roles of lncRNAs in both processes, we chose to further study the lncRNA LOUP [lncRNA originating from upstream regulatory element of SPI1 (also known as PU.1)], as it emerged as a top hit in both screens. Not only does LOUP regulate its neighboring gene, the myeloid fate-determining factor SPI1, thereby affecting monocyte to macrophage differentiation, but knockdown of LOUP leads to a broad upregulation of NFkB-targeted genes at baseline and upon TLR4-NFkB activation. LOUP also harbors three small open reading frames capable of being translated and are responsible for LOUP's ability to negatively regulate TLR4/NFkB signaling. This work emphasizes the value of high-throughput screening to rapidly identify functional lncRNAs in the innate immune system.
Collapse
Affiliation(s)
- Haley Halasz
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Eric Malekos
- Department of Biomolecular Engineering, University of California Santa Cruz, CA95064
| | - Sergio Covarrubias
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Samira Yitiz
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Christy Montano
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Lisa Sudek
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Sol Katzman
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - S. John Liu
- Department of Radiation Oncology, University of California, San Francisco, CA94158
- Department of Neurological Surgery, University of California, San Francisco, CA94158
| | - Max A. Horlbeck
- Department of Radiation Oncology, University of California, San Francisco, CA94158
- Department of Neurological Surgery, University of California, San Francisco, CA94158
- Department of Pediatrics, Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA02115
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA02138
| | - Leila Namvar
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| | - Jonathan S. Weissman
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA02142
- HHMI, Chevy Chase, MD20815
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA02142
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA02142
| | - Susan Carpenter
- Department of Molecular, Cell and Developmental Biology, University of California Santa Cruz, CA95064
| |
Collapse
|
15
|
Seo J, Liu H, Young K, Zhang X, Keku TO, Jones CD, North KE, Sandler RS, Peery AF. Genetic and transcriptomic landscape of colonic diverticulosis. Gut 2024; 73:932-940. [PMID: 38443061 PMCID: PMC11088512 DOI: 10.1136/gutjnl-2023-331267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 02/15/2024] [Indexed: 03/07/2024]
Abstract
OBJECTIVE Colonic diverticulosis is a prevalent condition among older adults, marked by the presence of thin-walled pockets in the colon wall that can become inflamed, infected, haemorrhage or rupture. We present a case-control genetic and transcriptomic study aimed at identifying the genetic and cellular determinants underlying this condition and the relationship with other gastrointestinal disorders. DESIGN We conducted DNA and RNA sequencing on colonic tissue from 404 patients with (N=172) and without (N=232) diverticulosis. We investigated variation in the transcriptome associated with diverticulosis and further integrated this variation with single-cell RNA-seq data from the human intestine. We also integrated our expression quantitative trait loci with genome-wide association study using Mendelian randomisation (MR). Furthermore, a Polygenic Risk Score analysis gauged associations between diverticulosis severity and other gastrointestinal disorders. RESULTS We discerned 38 genes with differential expression and 17 with varied transcript usage linked to diverticulosis, indicating tissue remodelling as a primary diverticula formation mechanism. Diverticula formation was primarily linked to stromal and epithelial cells in the colon including endothelial cells, myofibroblasts, fibroblasts, goblet, tuft, enterocytes, neurons and glia. MR highlighted five genes including CCN3, CRISPLD2, ENTPD7, PHGR1 and TNFSF13, with potential causal effects on diverticulosis. Notably, ENTPD7 upregulation was confirmed in diverticulosis cases. Additionally, diverticulosis severity was positively correlated with genetic predisposition to diverticulitis. CONCLUSION Our results suggest that tissue remodelling is a primary mechanism for diverticula formation. Individuals with an increased genetic proclivity to diverticulitis exhibit a larger numbers of diverticula on colonoscopy.
Collapse
Affiliation(s)
- Jungkyun Seo
- Department of Epidemiology, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Hongwei Liu
- Department of Genetics, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Kristin Young
- Department of Epidemiology, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Xinruo Zhang
- Department of Epidemiology, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Temitope O Keku
- Center for Gastrointestinal Biology and Disease, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Corbin D Jones
- Department of Biology and Integrative Program for Biological and Genome Sciences, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Kari E North
- Department of Epidemiology, The University of North Carolina, Chapel Hill, North Carolina, USA
| | - Robert S Sandler
- Gastroenterology and Hepatology, The University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Anne F Peery
- Center for Gastrointestinal Biology and Disease, The University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|
16
|
Young AM, Van Buren S, Rashid NU. Differential transcript usage analysis incorporating quantification uncertainty via compositional measurement error regression modeling. Biostatistics 2024; 25:559-576. [PMID: 37040757 PMCID: PMC11017126 DOI: 10.1093/biostatistics/kxad008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 12/22/2022] [Accepted: 02/06/2023] [Indexed: 04/13/2023] Open
Abstract
Differential transcript usage (DTU) occurs when the relative expression of multiple transcripts arising from the same gene changes between different conditions. Existing approaches to detect DTU often rely on computational procedures that can have speed and scalability issues as the number of samples increases. Here we propose a new method, CompDTU, that uses compositional regression to model the relative abundance proportions of each transcript that are of interest in DTU analyses. This procedure leverages fast matrix-based computations that make it ideally suited for DTU analysis with larger sample sizes. This method also allows for the testing of and adjustment for multiple categorical or continuous covariates. Additionally, many existing approaches for DTU ignore quantification uncertainty in the expression estimates for each transcript in RNA-seq data. We extend our CompDTU method to incorporate quantification uncertainty leveraging common output from RNA-seq expression quantification tool in a novel method CompDTUme. Through several power analyses, we show that CompDTU has excellent sensitivity and reduces false positive results relative to existing methods. Additionally, CompDTUme results in further improvements in performance over CompDTU with sufficient sample size for genes with high levels of quantification uncertainty, while also maintaining favorable speed and scalability. We motivate our methods using data from the Cancer Genome Atlas Breast Invasive Carcinoma data set, specifically using RNA-seq data from primary tumors for 740 patients with breast cancer. We show greatly reduced computation time from our new methods as well as the ability to detect several novel genes with significant DTU across different breast cancer subtypes.
Collapse
Affiliation(s)
- Amber M Young
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC, 27599, USA
| | - Scott Van Buren
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC, 27599, USA
| | - Naim U Rashid
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC, 27599, USA and Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, 450 West Drive, Chapel Hill, NC, 27599, USA
| |
Collapse
|
17
|
Kilfeather P, Khoo JH, Wagner K, Liang H, Caiazza MC, An Y, Zhang X, Chen X, Connor-Robson N, Shang Z, Wade-Martins R. Single-cell spatial transcriptomic and translatomic profiling of dopaminergic neurons in health, aging, and disease. Cell Rep 2024; 43:113784. [PMID: 38386560 DOI: 10.1016/j.celrep.2024.113784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 11/14/2023] [Accepted: 01/27/2024] [Indexed: 02/24/2024] Open
Abstract
The brain is spatially organized and contains unique cell types, each performing diverse functions and exhibiting differential susceptibility to neurodegeneration. This is exemplified in Parkinson's disease with the preferential loss of dopaminergic neurons of the substantia nigra pars compacta. Using a Parkinson's transgenic model, we conducted a single-cell spatial transcriptomic and dopaminergic neuron translatomic analysis of young and old mouse brains. Through the high resolving capacity of single-cell spatial transcriptomics, we provide a deep characterization of the expression features of dopaminergic neurons and 27 other cell types within their spatial context, identifying markers of healthy and aging cells, spanning Parkinson's relevant pathways. We integrate gene enrichment and genome-wide association study data to prioritize putative causative genes for disease investigation, identifying CASR as a regulator of dopaminergic calcium handling. These datasets represent the largest public resource for the investigation of spatial gene expression in brain cells in health, aging, and disease.
Collapse
Affiliation(s)
- Peter Kilfeather
- Oxford Parkinson's Disease Centre and Department of Physiology, Anatomy and Genetics, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK; Kavli Institute for Nanoscience Discovery, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | | | - Katherina Wagner
- Oxford Parkinson's Disease Centre and Department of Physiology, Anatomy and Genetics, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK
| | | | - Maria Claudia Caiazza
- Oxford Parkinson's Disease Centre and Department of Physiology, Anatomy and Genetics, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK; Kavli Institute for Nanoscience Discovery, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA
| | - Yanru An
- BGI Research, 49276 Riga, Latvia
| | | | | | - Natalie Connor-Robson
- Oxford Parkinson's Disease Centre and Department of Physiology, Anatomy and Genetics, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK
| | | | - Richard Wade-Martins
- Oxford Parkinson's Disease Centre and Department of Physiology, Anatomy and Genetics, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK; Kavli Institute for Nanoscience Discovery, University of Oxford, Dorothy Crowfoot Hodgkin Building, South Parks Road, Oxford OX1 3QU, UK; Aligning Science Across Parkinson's (ASAP) Collaborative Research Network, Chevy Chase, MD, USA.
| |
Collapse
|
18
|
Erdogdu B, Varabyou A, Hicks SC, Salzberg SL, Pertea M. Detecting differential transcript usage in complex diseases with SPIT. CELL REPORTS METHODS 2024; 4:100736. [PMID: 38508189 PMCID: PMC10985272 DOI: 10.1016/j.crmeth.2024.100736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 12/21/2023] [Accepted: 02/27/2024] [Indexed: 03/22/2024]
Abstract
Differential transcript usage (DTU) plays a crucial role in determining how gene expression differs among cells, tissues, and developmental stages, contributing to the complexity and diversity of biological systems. In abnormal cells, it can also lead to deficiencies in protein function and underpin disease pathogenesis. Analyzing DTU via RNA sequencing (RNA-seq) data is vital, but the genetic heterogeneity in populations with complex diseases presents an intricate challenge due to diverse causal events and undetermined subtypes. Although the majority of common diseases in humans are categorized as complex, state-of-the-art DTU analysis methods often overlook this heterogeneity in their models. We therefore developed SPIT, a statistical tool that identifies predominant subgroups in transcript usage within a population along with their distinctive sets of DTU events. This study provides comprehensive assessments of SPIT's methodology and applies it to analyze brain samples from individuals with schizophrenia, revealing previously unreported DTU events in six candidate genes.
Collapse
Affiliation(s)
- Beril Erdogdu
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA.
| | - Ales Varabyou
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA; Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Stephanie C Hicks
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA; Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA; Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, MD, USA
| | - Steven L Salzberg
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA; Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA; Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA; Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering, Baltimore, MD, USA; Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA; Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
19
|
Liu X, Chen H, Li Z, Yang X, Jin W, Wang Y, Zheng J, Li L, Xuan C, Yuan J, Yang Y. InPACT: a computational method for accurate characterization of intronic polyadenylation from RNA sequencing data. Nat Commun 2024; 15:2583. [PMID: 38519498 PMCID: PMC10960005 DOI: 10.1038/s41467-024-46875-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 03/12/2024] [Indexed: 03/25/2024] Open
Abstract
Alternative polyadenylation can occur in introns, termed intronic polyadenylation (IPA), has been implicated in diverse biological processes and diseases, as it can produce noncoding transcripts or transcripts with truncated coding regions. However, a reliable method is required to accurately characterize IPA. Here, we propose a computational method called InPACT, which allows for the precise characterization of IPA from conventional RNA-seq data. InPACT successfully identifies numerous previously unannotated IPA transcripts in human cells, many of which are translated, as evidenced by ribosome profiling data. We have demonstrated that InPACT outperforms other methods in terms of IPA identification and quantification. Moreover, InPACT applied to monocyte activation reveals temporally coordinated IPA events. Further application on single-cell RNA-seq data of human fetal bone marrow reveals the expression of several IPA isoforms in a context-specific manner. Therefore, InPACT represents a powerful tool for the accurate characterization of IPA from RNA-seq data.
Collapse
Affiliation(s)
- Xiaochuan Liu
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Hao Chen
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Zekun Li
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Xiaoxiao Yang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Wen Jin
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Yuting Wang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Jian Zheng
- Department of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Long Li
- Department of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Chenghao Xuan
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China.
| | - Jiapei Yuan
- State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300020, China.
- Tianjin Institutes of Health Science, Tianjin, 301600, China.
| | - Yang Yang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China.
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China.
| |
Collapse
|
20
|
Jones EF, Haldar A, Oza VH, Lasseigne BN. Quantifying transcriptome diversity: a review. Brief Funct Genomics 2024; 23:83-94. [PMID: 37225889 PMCID: PMC11484519 DOI: 10.1093/bfgp/elad019] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 04/14/2023] [Accepted: 05/05/2023] [Indexed: 05/26/2023] Open
Abstract
Following the central dogma of molecular biology, gene expression heterogeneity can aid in predicting and explaining the wide variety of protein products, functions and, ultimately, heterogeneity in phenotypes. There is currently overlapping terminology used to describe the types of diversity in gene expression profiles, and overlooking these nuances can misrepresent important biological information. Here, we describe transcriptome diversity as a measure of the heterogeneity in (1) the expression of all genes within a sample or a single gene across samples in a population (gene-level diversity) or (2) the isoform-specific expression of a given gene (isoform-level diversity). We first overview modulators and quantification of transcriptome diversity at the gene level. Then, we discuss the role alternative splicing plays in driving transcript isoform-level diversity and how it can be quantified. Additionally, we overview computational resources for calculating gene-level and isoform-level diversity for high-throughput sequencing data. Finally, we discuss future applications of transcriptome diversity. This review provides a comprehensive overview of how gene expression diversity arises, and how measuring it determines a more complete picture of heterogeneity across proteins, cells, tissues, organisms and species.
Collapse
Affiliation(s)
- Emma F Jones
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Anisha Haldar
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Vishal H Oza
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Brittany N Lasseigne
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| |
Collapse
|
21
|
Seyres D, Gorka O, Schmidt R, Marone R, Zavolan M, Jeker LT. T helper cells exhibit a dynamic and reversible 3'-UTR landscape. RNA (NEW YORK, N.Y.) 2024; 30:418-434. [PMID: 38302256 PMCID: PMC10946431 DOI: 10.1261/rna.079897.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 01/16/2024] [Indexed: 02/03/2024]
Abstract
3' untranslated regions (3' UTRs) are critical elements of messenger RNAs, as they contain binding sites for RNA-binding proteins (RBPs) and microRNAs that affect various aspects of the RNA life cycle including transcript stability and cellular localization. In response to T cell receptor activation, T cells undergo massive expansion during the effector phase of the immune response and dynamically modify their 3' UTRs. Whether this serves to directly regulate the abundance of specific mRNAs or is a secondary effect of proliferation remains unclear. To study 3'-UTR dynamics in T helper cells, we investigated division-dependent alternative polyadenylation (APA). In addition, we generated 3' end UTR sequencing data from naive, activated, memory, and regulatory CD4+ T cells. 3'-UTR length changes were estimated using a nonnegative matrix factorization approach and were compared with those inferred from long-read PacBio sequencing. We found that APA events were transient and reverted after effector phase expansion. Using an orthogonal bulk RNA-seq data set, we did not find evidence of APA association with differential gene expression or transcript usage, indicating that APA has only a marginal effect on transcript abundance. 3'-UTR sequence analysis revealed conserved binding sites for T cell-relevant microRNAs and RBPs in the alternative 3' UTRs. These results indicate that poly(A) site usage could play an important role in the control of cell fate decisions and homeostasis.
Collapse
Affiliation(s)
- Denis Seyres
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| | - Oliver Gorka
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| | - Ralf Schmidt
- Computational and Systems Biology, Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Romina Marone
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| | - Mihaela Zavolan
- Computational and Systems Biology, Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Lukas T Jeker
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| |
Collapse
|
22
|
Wani SA, Hussain S, Gray JS, Nayak D, Tang H, Perez LM, Long MD, Siddappa M, McCabe CJ, Sucheston-Campbell LE, Freeman MR, Campbell MJ. Epigenetic disruption of the RARγ complex impairs its function to bookmark AR enhancer interactions required for enzalutamide sensitivity in prostate cancer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.12.15.571947. [PMID: 38168185 PMCID: PMC10760102 DOI: 10.1101/2023.12.15.571947] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
The current study in prostate cancer (PCa) focused on the genomic mechanisms at the cross-roads of pro-differentiation signals and the emergence of lineage plasticity. We explored an understudied cistromic mechanism involving RARγ's ability to govern AR cistrome-transcriptome relationships, including those associated with more aggressive PCa features. The RARγ complex in PCa cell models was enriched for canonical cofactors, as well as proteins involved in RNA processing and bookmarking. Identifying the repertoire of miR-96 bound and regulated gene targets, including those recognition elements marked by m6A, revealed their significant enrichment in the RARγ complex. RARγ significantly enhanced the AR cistrome, particularly in active enhancers and super-enhancers, and overlapped with the binding of bookmarking factors. Furthermore, RARγ expression led to nucleosome-free chromatin enriched with H3K27ac, and significantly enhanced the AR cistrome in G2/M cells. RARγ functions also antagonized the transcriptional actions of the lineage master regulator ONECUT2. Similarly, gene programs regulated by either miR-96 or antagonized by RARγ were enriched in alternative lineages and more aggressive PCa phenotypes. Together these findings reveal an under-investigated role for RARγ, modulated by miR-96, to bookmark enhancer sites during mitosis. These sites are required by the AR to promote transcriptional competence, and emphasize luminal differentiation, while antagonizing ONECUT2.
Collapse
Affiliation(s)
- Sajad A Wani
- Division of Pharmaceutics and Pharmacology, The Ohio State University, Columbus, OH 43210
| | - Shahid Hussain
- Division of Cancer Biology, Cedars Sinai Cancer, and Los Angeles, CA 90048
- Board of Governors Innovation Center, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| | - Jaimie S Gray
- Division of Pharmaceutics and Pharmacology, The Ohio State University, Columbus, OH 43210
| | - Debasis Nayak
- Division of Pharmaceutics and Pharmacology, The Ohio State University, Columbus, OH 43210
| | - Hancong Tang
- Division of Pharmaceutics and Pharmacology, The Ohio State University, Columbus, OH 43210
| | - Lillian M Perez
- Division of Cancer Therapeutics, Cedars Sinai Cancer, Departments of Urology and Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| | - Mark D Long
- Roswell Park Comprehensive Cancer Center, Elm and Carlton Streets, Buffalo, NY 14263
| | - Manjunath Siddappa
- Division of Pharmaceutics and Pharmacology, The Ohio State University, Columbus, OH 43210
| | - Christopher J McCabe
- Institute of Metabolism and Systems Research (IMSR), and Centre of Endocrinology, Diabetes and Metabolism (CEDAM), University of Birmingham, Birmingham, UK
| | | | - Michael R Freeman
- Division of Cancer Therapeutics, Cedars Sinai Cancer, Departments of Urology and Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| | - Moray J Campbell
- Division of Cancer Biology, Cedars Sinai Cancer, and Los Angeles, CA 90048
- Board of Governors Innovation Center, Department of Biomedical Sciences, Cedars-Sinai Medical Center, Los Angeles, CA 90048
| |
Collapse
|
23
|
Sananmuang T, Puthier D, Nguyen C, Chokeshaiusaha K. Differential transcript usage across mammalian oocytes at the germinal vesicle and metaphase II stages. Theriogenology 2024; 215:1-9. [PMID: 37995439 DOI: 10.1016/j.theriogenology.2023.11.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 11/11/2023] [Accepted: 11/13/2023] [Indexed: 11/25/2023]
Abstract
Ongoing progress in mRNA-Sequencing technologies has significantly contributed to the refinement of assisted reproductive technologies. However, the prior investigations have predominantly concentrated on alterations in overall gene expression levels, thereby leaving a considerable gap in our understanding of the influence of transcript isoform expression on fundamental cellular mechanisms of oocytes. Given the efficacy of differential transcript usage (DTU) analysis to address such knowledge, we conducted comprehensive DTU analysis utilizing mRNA-Seq datasets of germinal vesicle (GV) and metaphase II (MII) oocytes across six mammalian species from the SRA database, including cow, donkey, horse, human, mouse, and pig. To further illuminate the roles of these genes, we also conducted a rigorous Gene Ontology (GO) term enrichment analysis. While the DTU analysis of each species exhibited several genes with alterations in their transcript isoform usage, referred to as DTU genes, this study focused on only ten cross-species DTU genes sharing among a minimum of five distinct species (FDR≤0.05). These cross-species DTU genes were as follows: ABCF1, CDC6, CFAP36, CNOT10, DNM3, IWS1, NBN, NDEL1, RAD50 and ZCCHC17. GO term enrichment analysis unveiled the alignment of these cross-species DTU gene functions with RNA and cell-cycle control mechanisms across diverse mammalian species, thereby suggesting their vital roles during oocyte maturation. Further exploration of the transcript isoforms of these genes hence bore the potential to uncover novel transcript isoform markers for future reproductive technologies in both human and animal contexts.
Collapse
Affiliation(s)
- Thanida Sananmuang
- Rajamangala University of Technology Tawan-OK, Faculty of Veterinary Medicine, Chonburi, Thailand
| | - Denis Puthier
- Aix-Marseille Université, INSERM UMR 1090, TAGC, Marseille, France
| | - Catherine Nguyen
- Aix-Marseille Université, INSERM UMR 1090, TAGC, Marseille, France
| | - Kaj Chokeshaiusaha
- Rajamangala University of Technology Tawan-OK, Faculty of Veterinary Medicine, Chonburi, Thailand.
| |
Collapse
|
24
|
Vad OB, Angeli E, Liss M, Ahlberg G, Andreasen L, Christophersen IE, Hansen CC, Møller S, Hellsten Y, Haunsoe S, Tveit A, Svendsen JH, Gotthardt M, Lundegaard PR, Olesen MS. Loss of Cardiac Splicing Regulator RBM20 Is Associated With Early-Onset Atrial Fibrillation. JACC Basic Transl Sci 2024; 9:163-180. [PMID: 38510713 PMCID: PMC10950405 DOI: 10.1016/j.jacbts.2023.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 08/24/2023] [Accepted: 08/29/2023] [Indexed: 03/22/2024]
Abstract
We showed an association between atrial fibrillation and rare loss-of-function (LOF) variants in the cardiac splicing regulator RBM20 in 2 independent cohorts. In a rat model with loss of RBM20, we demonstrated altered splicing of sarcomere genes (NEXN, TTN, TPM1, MYOM1, and LDB3), and differential expression in key cardiac genes. We identified altered sarcomere and mitochondrial structure on electron microscopy imaging and found compromised mitochondrial function. Finally, we demonstrated that 3 novel LOF variants in RBM20, identified in patients with atrial fibrillation, lead to significantly reduced splicing activity. Our results implicate alternative splicing as a novel proarrhythmic mechanism in the atria.
Collapse
Affiliation(s)
- Oliver B. Vad
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Elisavet Angeli
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Martin Liss
- Neuromuscular and Cardiovascular Cell Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Gustav Ahlberg
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Laura Andreasen
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
| | - Ingrid E. Christophersen
- Department of Medical Research, Bærum Hospital, Vestre Viken Hospital Trust, Gjettum, Norway
- Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
| | - Camilla C. Hansen
- The August Krogh Section for Human Physiology, Department of Nutrition, Exercise and Sports, University of Copenhagen, Copenhagen, Denmark
| | - Sophie Møller
- The August Krogh Section for Human Physiology, Department of Nutrition, Exercise and Sports, University of Copenhagen, Copenhagen, Denmark
| | - Ylva Hellsten
- The August Krogh Section for Human Physiology, Department of Nutrition, Exercise and Sports, University of Copenhagen, Copenhagen, Denmark
| | - Stig Haunsoe
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
| | - Arnljot Tveit
- Department of Medical Research, Bærum Hospital, Vestre Viken Hospital Trust, Gjettum, Norway
- Institute of Clinical Medicine, Department of Cardiology, University of Oslo, Oslo, Norway
| | - Jesper H. Svendsen
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Michael Gotthardt
- Neuromuscular and Cardiovascular Cell Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- Department of Cardiology, Charité Universitätsmedizin Berlin, Berlin, Germany
- German Center for Cardiovascular Research, partner site Berlin, Berlin, Germany
| | - Pia R. Lundegaard
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Morten S. Olesen
- Department of Cardiology, Copenhagen University Hospital–Rigshospitalet, Copenhagen, Denmark
- Department of Biomedical Sciences, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
25
|
LaPierre N, Pimentel H. Accounting for isoform expression increases power to identify genetic regulation of gene expression. PLoS Comput Biol 2024; 20:e1011857. [PMID: 38346082 PMCID: PMC10890775 DOI: 10.1371/journal.pcbi.1011857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 02/23/2024] [Accepted: 01/23/2024] [Indexed: 02/25/2024] Open
Abstract
A core problem in genetics is molecular quantitative trait locus (QTL) mapping, in which genetic variants associated with changes in the molecular phenotypes are identified. One of the most-studied molecular QTL mapping problems is expression QTL (eQTL) mapping, in which the molecular phenotype is gene expression. It is common in eQTL mapping to compute gene expression by aggregating the expression levels of individual isoforms from the same gene and then performing linear regression between SNPs and this aggregated gene expression level. However, SNPs may regulate isoforms from the same gene in different directions due to alternative splicing, or only regulate the expression level of one isoform, causing this approach to lose power. Here, we examine a broader question: which genes have at least one isoform whose expression level is regulated by genetic variants? In this study, we propose and evaluate several approaches to answering this question, demonstrating that "isoform-aware" methods-those that account for the expression levels of individual isoforms-have substantially greater power to answer this question than standard "gene-level" eQTL mapping methods. We identify settings in which different approaches yield an inflated number of false discoveries or lose power. In particular, we show that calling an eGene if there is a significant association between a SNP and any isoform fails to control False Discovery Rate, even when applying standard False Discovery Rate correction. We show that similar trends are observed in real data from the GEUVADIS and GTEx studies, suggesting the possibility that similar effects are present in these consortia.
Collapse
Affiliation(s)
- Nathan LaPierre
- Department of Computer Science, University of California, Los Angeles, California, United States of America
- Department of Human Genetics, University of Chicago, Illinois, United States of America
| | - Harold Pimentel
- Department of Human Genetics, University of California, Los Angeles, California, United States of America
- Howard Hughes Medical Institute, Chevy Chase, Maryland, United States of America
- Department of Computational Medicine, University of California, Los Angeles, California, United States of America
| |
Collapse
|
26
|
Goddard TR, Brookes KJ, Sharma R, Moemeni A, Rajkumar AP. Dementia with Lewy Bodies: Genomics, Transcriptomics, and Its Future with Data Science. Cells 2024; 13:223. [PMID: 38334615 PMCID: PMC10854541 DOI: 10.3390/cells13030223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 01/17/2024] [Accepted: 01/23/2024] [Indexed: 02/10/2024] Open
Abstract
Dementia with Lewy bodies (DLB) is a significant public health issue. It is the second most common neurodegenerative dementia and presents with severe neuropsychiatric symptoms. Genomic and transcriptomic analyses have provided some insight into disease pathology. Variants within SNCA, GBA, APOE, SNCB, and MAPT have been shown to be associated with DLB in repeated genomic studies. Transcriptomic analysis, conducted predominantly on candidate genes, has identified signatures of synuclein aggregation, protein degradation, amyloid deposition, neuroinflammation, mitochondrial dysfunction, and the upregulation of heat-shock proteins in DLB. Yet, the understanding of DLB molecular pathology is incomplete. This precipitates the current clinical position whereby there are no available disease-modifying treatments or blood-based diagnostic biomarkers. Data science methods have the potential to improve disease understanding, optimising therapeutic intervention and drug development, to reduce disease burden. Genomic prediction will facilitate the early identification of cases and the timely application of future disease-modifying treatments. Transcript-level analyses across the entire transcriptome and machine learning analysis of multi-omic data will uncover novel signatures that may provide clues to DLB pathology and improve drug development. This review will discuss the current genomic and transcriptomic understanding of DLB, highlight gaps in the literature, and describe data science methods that may advance the field.
Collapse
Affiliation(s)
- Thomas R. Goddard
- Mental Health and Clinical Neurosciences Academic Unit, Institute of Mental Health, School of Medicine, University of Nottingham, Nottingham NG7 2TU, UK
| | - Keeley J. Brookes
- Department of Biosciences, School of Science & Technology, Nottingham Trent University, Nottingham NG11 8NS, UK
| | - Riddhi Sharma
- Biodiscovery Institute, School of Medicine, University of Nottingham, Nottingham NG7 2RD, UK
- UK Health Security Agency, Radiation Effects Department, Radiation Protection Science Division, Harwell Science Campus, Didcot, Oxfordshire OX11 0RQ, UK
| | - Armaghan Moemeni
- School of Computer Science, University of Nottingham, Nottingham NG8 1BB, UK
| | - Anto P. Rajkumar
- Mental Health and Clinical Neurosciences Academic Unit, Institute of Mental Health, School of Medicine, University of Nottingham, Nottingham NG7 2TU, UK
| |
Collapse
|
27
|
Kim M, Lee HH, Won SD, Jang Y, Kim BG, Cho NH, Choi YD, Chung JS, Han HH. Deciphering the Role of ERBB3 Isoforms in Renal Cell Carcinoma: A Comprehensive Genomic and Transcriptomic Analysis. MEDICINA (KAUNAS, LITHUANIA) 2024; 60:181. [PMID: 38276060 PMCID: PMC10820170 DOI: 10.3390/medicina60010181] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 01/17/2024] [Accepted: 01/19/2024] [Indexed: 01/27/2024]
Abstract
ERBB3, a key member of the receptor tyrosine kinase family, is implicated in the progression and development of various human cancers, affecting cellular proliferation and survival. This study investigated the expression of ERBB3 isoforms in renal clear cell carcinoma (RCC), utilizing data from 538 patients from The Cancer Genome Atlas (TCGA) Firehose Legacy dataset. Employing the SUPPA2 tool, the activity of 10 ERBB3 isoforms was examined, revealing distinct expression patterns in RCC. Isoforms uc001sjg.3 and uc001sjh.3 were found to have reduced activity in tumor tissues, while uc010sqb.2 and uc001sjl.3 demonstrated increased activity. These variations in isoform expression correlate with patient survival and tumor aggressiveness, indicating their complex role in RCC. The study, further, utilizes CIBERSORTx to analyze the association between ERBB3 isoforms and immune cell profiles in the tumor microenvironment. Concurrently, Gene Set Enrichment Analysis (GSEA) was applied, establishing a strong link between elevated levels of ERBB3 isoforms and critical oncogenic pathways, including DNA repair and androgen response. RT-PCR analysis targeting the exon 21-23 and exon 23 regions of ERBB3 confirmed its heightened expression in tumor tissues, underscoring the significance of alternative splicing and exon utilization in cancer development. These findings elucidate the diverse impacts of ERBB3 isoforms on RCC, suggesting their potential as diagnostic markers and therapeutic targets. This study emphasizes the need for further exploration into the specific roles of these isoforms, which could inform more personalized and effective treatment modalities for renal clear cell carcinoma.
Collapse
Affiliation(s)
- Mingyu Kim
- Center for Urologic Cancer, National Cancer Center, 323, Ilsan-Ro, Ilsandong-Gu, Goyang-Si 10408, Gyeonggi-Do, Republic of Korea; (M.K.); (H.H.L.); (S.D.W.)
| | - Hyung Ho Lee
- Center for Urologic Cancer, National Cancer Center, 323, Ilsan-Ro, Ilsandong-Gu, Goyang-Si 10408, Gyeonggi-Do, Republic of Korea; (M.K.); (H.H.L.); (S.D.W.)
| | - So Dam Won
- Center for Urologic Cancer, National Cancer Center, 323, Ilsan-Ro, Ilsandong-Gu, Goyang-Si 10408, Gyeonggi-Do, Republic of Korea; (M.K.); (H.H.L.); (S.D.W.)
| | - YeonSue Jang
- Department of Pathology, Yonsei University College of Medicine, Seoul 03722, Republic of Korea; (Y.J.); (B.G.K.)
| | - Baek Gil Kim
- Department of Pathology, Yonsei University College of Medicine, Seoul 03722, Republic of Korea; (Y.J.); (B.G.K.)
| | - Nam Hoon Cho
- Department of Pathology, Yonsei University College of Medicine, Seoul 03722, Republic of Korea; (Y.J.); (B.G.K.)
| | - Young Deuk Choi
- Department of Urology, Urological Science Institute, Yonsei University College of Medicine, 50-1, Yonsei-Ro, Seodaemun-Gu, Seoul 03722, Republic of Korea;
| | - Jin Soo Chung
- Center for Urologic Cancer, National Cancer Center, 323, Ilsan-Ro, Ilsandong-Gu, Goyang-Si 10408, Gyeonggi-Do, Republic of Korea; (M.K.); (H.H.L.); (S.D.W.)
| | - Hyun Ho Han
- Department of Urology, Urological Science Institute, Yonsei University College of Medicine, 50-1, Yonsei-Ro, Seodaemun-Gu, Seoul 03722, Republic of Korea;
| |
Collapse
|
28
|
Lio CT, Düz T, Hoffmann M, Willruth LL, Baumbach J, List M, Tsoy O. Comprehensive benchmark of differential transcript usage analysis for static and dynamic conditions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.14.575548. [PMID: 38313260 PMCID: PMC10836064 DOI: 10.1101/2024.01.14.575548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2024]
Abstract
RNA sequencing offers unique insights into transcriptome diversity, and a plethora of tools have been developed to analyze alternative splicing. One important task is to detect changes in the relative transcript abundance in differential transcript usage (DTU) analysis. The choice of the right analysis tool is non-trivial and depends on experimental factors such as the availability of single- or paired-end and bulk or single-cell data. To help users select the most promising tool for their task, we performed a comprehensive benchmark of DTU detection tools. We cover a wide array of experimental settings, using simulated bulk and single-cell RNA-seq data as well as real transcriptomics datasets, including time-series data. Our results suggest that DEXSeq, edgeR, and LimmaDS are better choices for paired-end data, while DSGseq and DEXSeq can be used for single-end data. In single-cell simulation settings, we showed that satuRn performs better than DTUrtle. In addition, we showed that Spycone is optimal for time series DTU/IS analysis based on the evidence provided using GO terms enrichment analysis.
Collapse
Affiliation(s)
- Chit Tong Lio
- Data Science in Systems Biology, Technical University of Munich, 85354 Freising, Germany
| | - Tolga Düz
- Chair of Computational Systems Biology, University of Hamburg, Notkestrasse 9, 22607 Hamburg, Germany
| | - Markus Hoffmann
- Data Science in Systems Biology, Technical University of Munich, 85354 Freising, Germany
- Institute for Advanced Study, Technical University of Munich, Garching D-85748, Germany
- National Institute of Diabetes, Digestive, and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lina-Liv Willruth
- Data Science in Systems Biology, Technical University of Munich, 85354 Freising, Germany
| | - Jan Baumbach
- Chair of Computational Systems Biology, University of Hamburg, Notkestrasse 9, 22607 Hamburg, Germany
- Institute of Mathematics and Computer Science, University of Southern Denmark, Campusvej 55, 5000 Odense, Denmark
| | - Markus List
- Data Science in Systems Biology, Technical University of Munich, 85354 Freising, Germany
| | - Olga Tsoy
- Chair of Computational Systems Biology, University of Hamburg, Notkestrasse 9, 22607 Hamburg, Germany
| |
Collapse
|
29
|
Engquist EN, Greco A, Joosten LAB, van Engelen BGM, Zammit PS, Banerji CRS. FSHD muscle shows perturbation in fibroadipogenic progenitor cells, mitochondrial function and alternative splicing independently of inflammation. Hum Mol Genet 2024; 33:182-197. [PMID: 37856562 PMCID: PMC10772042 DOI: 10.1093/hmg/ddad175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 09/25/2023] [Accepted: 10/10/2023] [Indexed: 10/21/2023] Open
Abstract
Facioscapulohumeral muscular dystrophy (FSHD) is a prevalent, incurable myopathy. FSHD is highly heterogeneous, with patients following a variety of clinical trajectories, complicating clinical trials. Skeletal muscle in FSHD undergoes fibrosis and fatty replacement that can be accelerated by inflammation, adding to heterogeneity. Well controlled molecular studies are thus essential to both categorize FSHD patients into distinct subtypes and understand pathomechanisms. Here, we further analyzed RNA-sequencing data from 24 FSHD patients, each of whom donated a biopsy from both a non-inflamed (TIRM-) and inflamed (TIRM+) muscle, and 15 FSHD patients who donated peripheral blood mononucleated cells (PBMCs), alongside non-affected control individuals. Differential gene expression analysis identified suppression of mitochondrial biogenesis and up-regulation of fibroadipogenic progenitor (FAP) gene expression in FSHD muscle, which was particularly marked on inflamed samples. PBMCs demonstrated suppression of antigen presentation in FSHD. Gene expression deconvolution revealed FAP expansion as a consistent feature of FSHD muscle, via meta-analysis of 7 independent transcriptomic datasets. Clustering of muscle biopsies separated patients in an unbiased manner into clinically mild and severe subtypes, independently of known disease modifiers (age, sex, D4Z4 repeat length). Lastly, the first genome-wide analysis of alternative splicing in FSHD muscle revealed perturbation of autophagy, BMP2 and HMGB1 signalling. Overall, our findings reveal molecular subtypes of FSHD with clinical relevance and identify novel pathomechanisms for this highly heterogeneous condition.
Collapse
Affiliation(s)
- Elise N Engquist
- Randall Centre for Cell and Molecular Biophysics, King's College London, New Hunt's House, Guy's Campus, London SE1 1UL, United Kingdom
| | - Anna Greco
- Department of Neurology, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, 6525 GA, The Netherlands
- Department of Internal Medicine, Radboud Institute of Molecular Life Sciences (RIMLS) and Radboud Center of Infectious Diseases (RCI), Radboud University Medical Center, Geert Grooteplein Zuid 10, Nijmegen 6525 GA, The Netherlands
| | - Leo A B Joosten
- Department of Internal Medicine, Radboud Institute of Molecular Life Sciences (RIMLS) and Radboud Center of Infectious Diseases (RCI), Radboud University Medical Center, Geert Grooteplein Zuid 10, Nijmegen 6525 GA, The Netherlands
- Department of Medical Genetics, Iuliu Hatieganu University of Medicine and Pharmacy, 400012, Cluj-Napoca, Romania
| | - Baziel G M van Engelen
- Department of Neurology, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, 6525 GA, The Netherlands
| | - Peter S Zammit
- Randall Centre for Cell and Molecular Biophysics, King's College London, New Hunt's House, Guy's Campus, London SE1 1UL, United Kingdom
| | - Christopher R S Banerji
- Randall Centre for Cell and Molecular Biophysics, King's College London, New Hunt's House, Guy's Campus, London SE1 1UL, United Kingdom
- The Alan Turing Institute, The British Library, 96 Euston Road, London NW1 2DB, United Kingdom
| |
Collapse
|
30
|
Gray JS, Wani SA, Hussain S, Huang P, Nayak D, Long MD, Yates C, Clinton SK, Bennet CE, Coss CC, Campbell MJ. The MYC axis in advanced prostate cancer is impacted through concurrent targeting of ERβ and AR using a novel ERβ-selective ligand alongside Enzalutamide. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.15.567282. [PMID: 38014010 PMCID: PMC10680693 DOI: 10.1101/2023.11.15.567282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
We have dissected the role of Estrogen receptor beta (ERβ) in prostate cancer (PCa) with a novel ERβ ligand, OSU-ERb-12. Drug screens revealed additive interactions between OSU-ERB-12 and either epigenetic inhibitors or the androgen receptor antagonist, Enzalutamide (Enza). Clonogenic and cell biolody studies supported the potent additive effects of OSU-ERB-12 (100nM) and Enza (1μM). The cooperative behavior was in PCa cell lines treated with either OSU-ERB-12 plus Enza or combinations involving 17β-estradiol (E2). OSU-ERb-12 plus Enza uniquely impacted the transcriptiome, accessible chromatin, and the AR, MYC and H3K27ac cistromes. This included skewed transcriptional responses including suppression of the androgen and MYC transcriptomes, and repressed MYC protein. OSU-ERb-12 plus Enza uniquely impacted chromatin accessibility at approximately 3000 nucleosome-free sites, enriched at enhancers, enriched for basic Helix-Loop-Helix motifs. CUT&RUN experiments revealed combination treatment targeting of MYC, AR, and H3K27ac again shaping enhancer accessibility. Specifically, it repressed MYC interactions at enhancer regions enriched for bHLH motifs, and overlapped with publicly-available bHLH cistromes. Finally, cistrome-transcriptome analyses identified ~200 genes that distinguished advanced PCa tumors in the SU2C cohort with high androgen and low neuroendocrine scores.
Collapse
Affiliation(s)
- Jaimie S. Gray
- College of Pharmacy, Division of Pharmaceutics and Pharmacology; The Ohio State University, Columbus, OH 43210
- College of Medicine; The Ohio State University, Columbus, OH 43210
- Comprehensive Cancer Center; The Ohio State University, Columbus, OH 43210
| | - Sajad A. Wani
- College of Pharmacy, Division of Pharmaceutics and Pharmacology; The Ohio State University, Columbus, OH 43210
- Comprehensive Cancer Center; The Ohio State University, Columbus, OH 43210
| | - Shahid Hussain
- Board of Governors Innovation Center; Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA 90048
- Cedars-Sinai Cancer; Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA 90048
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA 90048
| | - Phoebe Huang
- College of Pharmacy, Division of Pharmaceutics and Pharmacology; The Ohio State University, Columbus, OH 43210
| | - Debasis Nayak
- College of Pharmacy, Division of Pharmaceutics and Pharmacology; The Ohio State University, Columbus, OH 43210
| | - Mark D. Long
- Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY 14263, USA
| | - Clayton Yates
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21287 USA
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD 21287
- Department of Oncology Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, MD 21287 USA
| | - Steven K. Clinton
- College of Medicine; The Ohio State University, Columbus, OH 43210
- Comprehensive Cancer Center; The Ohio State University, Columbus, OH 43210
| | - Chad E. Bennet
- Drug Development Institute, Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210
| | - Christopher C. Coss
- College of Pharmacy, Division of Pharmaceutics and Pharmacology; The Ohio State University, Columbus, OH 43210
- Comprehensive Cancer Center; The Ohio State University, Columbus, OH 43210
- Drug Development Institute, Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210
| | - Moray J. Campbell
- College of Pharmacy, Division of Pharmaceutics and Pharmacology; The Ohio State University, Columbus, OH 43210
- Board of Governors Innovation Center; Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA 90048
- Cedars-Sinai Cancer; Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA 90048
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, 8700 Beverly Blvd, Los Angeles, CA 90048
| |
Collapse
|
31
|
Dong X, Du MRM, Gouil Q, Tian L, Jabbari JS, Bowden R, Baldoni PL, Chen Y, Smyth GK, Amarasinghe SL, Law CW, Ritchie ME. Benchmarking long-read RNA-sequencing analysis tools using in silico mixtures. Nat Methods 2023; 20:1810-1821. [PMID: 37783886 DOI: 10.1038/s41592-023-02026-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 08/25/2023] [Indexed: 10/04/2023]
Abstract
The lack of benchmark data sets with inbuilt ground-truth makes it challenging to compare the performance of existing long-read isoform detection and differential expression analysis workflows. Here, we present a benchmark experiment using two human lung adenocarcinoma cell lines that were each profiled in triplicate together with synthetic, spliced, spike-in RNAs (sequins). Samples were deeply sequenced on both Illumina short-read and Oxford Nanopore Technologies long-read platforms. Alongside the ground-truth available via the sequins, we created in silico mixture samples to allow performance assessment in the absence of true positives or true negatives. Our results show that StringTie2 and bambu outperformed other tools from the six isoform detection tools tested, DESeq2, edgeR and limma-voom were best among the five differential transcript expression tools tested and there was no clear front-runner for performing differential transcript usage analysis between the five tools compared, which suggests further methods development is needed for this application.
Collapse
Affiliation(s)
- Xueyi Dong
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia.
| | - Mei R M Du
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
| | - Quentin Gouil
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
| | - Luyi Tian
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
- Guangzhou National Laboratory, Guangzhou, China
| | - Jafar S Jabbari
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
| | - Rory Bowden
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
| | - Pedro L Baldoni
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
| | - Yunshun Chen
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
| | - Gordon K Smyth
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- School of Mathematics and Statistics, The University of Melbourne, Parkville, Victoria, Australia
| | - Shanika L Amarasinghe
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
- The Australian Regenerative Medicine Institute, Monash University, Clayton, Victoria, Australia
| | - Charity W Law
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia
| | - Matthew E Ritchie
- The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia.
- Department of Medical Biology, The University of Melbourne, Parkville, Victoria, Australia.
| |
Collapse
|
32
|
Soulette CM, Hrabeta-Robinson E, Arevalo C, Felton C, Tang AD, Marin MG, Brooks AN. Full-length transcript alterations in human bronchial epithelial cells with U2AF1 S34F mutations. Life Sci Alliance 2023; 6:e202000641. [PMID: 37487637 PMCID: PMC10366530 DOI: 10.26508/lsa.202000641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 06/30/2023] [Accepted: 07/03/2023] [Indexed: 07/26/2023] Open
Abstract
U2AF1 is one of the most recurrently mutated splicing factors in lung adenocarcinoma and has been shown to cause transcriptome-wide pre-mRNA splicing alterations; however, the full-length altered mRNA isoforms associated with the mutation are largely unknown. To better understand the impact U2AF1 has on full-length isoform fate and function, we conducted high-throughput long-read cDNA sequencing from isogenic human bronchial epithelial cells with and without a U2AF1 S34F mutation. We identified 49,366 multi-exon transcript isoforms, more than half of which did not match GENCODE or short-read-assembled isoforms. We found 198 transcript isoforms with significant expression and usage changes relative to WT, only 68% of which were assembled by short reads. Expression of isoforms from immune-related genes is largely down-regulated in mutant cells and without observed splicing changes. Finally, we reveal that isoforms likely targeted by nonsense-mediated decay are down-regulated in U2AF1 S34F cells, suggesting that isoform changes may alter the translational output of those affected genes. Altogether, our work provides a resource of full-length isoforms associated with U2AF1 S34F in lung cells.
Collapse
Affiliation(s)
- Cameron M Soulette
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Cruz, CA, USA
| | - Eva Hrabeta-Robinson
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Carlos Arevalo
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Cruz, CA, USA
| | - Colette Felton
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Alison D Tang
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Maximillian G Marin
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Angela N Brooks
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| |
Collapse
|
33
|
Makhnovskii PA, Lednev EM, Gavrilova AO, Kurochkina NS, Vepkhvadze TF, Shestakova MV, Popov DV. Dysregulation of early gene response to a mixed meal in skeletal muscle in obesity and type 2 diabetes. Physiol Genomics 2023; 55:468-477. [PMID: 37545425 DOI: 10.1152/physiolgenomics.00046.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/10/2023] [Accepted: 07/30/2023] [Indexed: 08/08/2023] Open
Abstract
Obesity- and type 2 diabetes mellitus-induced changes in the expression of protein-coding genes in human skeletal muscle were extensively examined at baseline (after an overnight fast). We aimed to compare the early transcriptomic response to a typical single meal in skeletal muscle of metabolically healthy subjects and obese individuals without and with type 2 diabetes. Transcriptomic response (RNA-seq) to a mixed meal (nutritional drink, ∼25 kJ/kg of body mass) was examined in the vastus lateralis muscle (1 h after a meal) in 7 healthy subjects and 14 obese individuals without or with type 2 diabetes. In all obese individuals, the transcriptome response to a meal was dysregulated (suppressed and altered) and associated with different biological processes compared with healthy control. To search for potential transcription factors regulating transcriptomic response to a meal, the enrichment of transcription factor-binding sites in individual promoters of the human skeletal muscle was examined. In obese individuals, the transcriptomic response is associated with a different set of transcription factors than that in healthy subjects. In conclusion, metabolic disorders are associated with a defect in the regulation of mixed meal/insulin-mediated gene expression-insulin resistance in terms of gene expression. Importantly, this dysregulation occurs in obese individuals without type 2 diabetes, i.e., at the first stage of the development of metabolic disorders.NEW & NOTEWORTHY In skeletal muscle of metabolically healthy subjects, a typical single meal normalized to body mass induces activation of various transcription factors, expression of numerous receptor tyrosine kinases associated with the insulin signaling cascade, and transcription regulators. In skeletal muscle of obese individuals without and with type 2 diabetes, this signaling network is poorly regulated at the transcriptional level, indicating dysregulation of the early gene response to a mixed meal.
Collapse
Affiliation(s)
- Pavel A Makhnovskii
- Institute of Biomedical Problems, Russian Academy of Sciences, Moscow, Russia
| | - Egor M Lednev
- Institute of Biomedical Problems, Russian Academy of Sciences, Moscow, Russia
- Diabetes Institute, National Medical Research Centre for Endocrinology, Moscow, Russia
| | - Alina O Gavrilova
- Diabetes Institute, National Medical Research Centre for Endocrinology, Moscow, Russia
| | - Nadia S Kurochkina
- Institute of Biomedical Problems, Russian Academy of Sciences, Moscow, Russia
| | - Tatiana F Vepkhvadze
- Institute of Biomedical Problems, Russian Academy of Sciences, Moscow, Russia
- Faculty of Fundamental Medicine, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Marina V Shestakova
- Diabetes Institute, National Medical Research Centre for Endocrinology, Moscow, Russia
| | - Daniil V Popov
- Institute of Biomedical Problems, Russian Academy of Sciences, Moscow, Russia
- Faculty of Fundamental Medicine, M.V. Lomonosov Moscow State University, Moscow, Russia
| |
Collapse
|
34
|
Sneha NP, Dharshini SAP, Taguchi YH, Gromiha MM. Investigating Neuron Degeneration in Huntington's Disease Using RNA-Seq Based Transcriptome Study. Genes (Basel) 2023; 14:1801. [PMID: 37761940 PMCID: PMC10530489 DOI: 10.3390/genes14091801] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 09/02/2023] [Accepted: 09/11/2023] [Indexed: 09/29/2023] Open
Abstract
Huntington's disease (HD) is a progressive neurodegenerative disorder caused due to a CAG repeat expansion in the huntingtin (HTT) gene. The primary symptoms of HD include motor dysfunction such as chorea, dystonia, and involuntary movements. The primary motor cortex (BA4) is the key brain region responsible for executing motor/movement activities. Investigating patient and control samples from the BA4 region will provide a deeper understanding of the genes responsible for neuron degeneration and help to identify potential markers. Previous studies have focused on overall differential gene expression and associated biological functions. In this study, we illustrate the relationship between variants and differentially expressed genes/transcripts. We identified variants and their associated genes along with the quantification of genes and transcripts. We also predicted the effect of variants on various regulatory activities and found that many variants are regulating gene expression. Variants affecting miRNA and its targets are also highlighted in our study. Co-expression network studies revealed the role of novel genes. Function interaction network analysis unveiled the importance of genes involved in vesicle-mediated transport. From this unified approach, we propose that genes expressed in immune cells are crucial for reducing neuron death in HD.
Collapse
Affiliation(s)
- Nela Pragathi Sneha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; (N.P.S.); (S.A.P.D.)
| | - S. Akila Parvathy Dharshini
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; (N.P.S.); (S.A.P.D.)
| | - Y.-h. Taguchi
- Department of Physics, Chuo University, Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan;
| | - M. Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; (N.P.S.); (S.A.P.D.)
| |
Collapse
|
35
|
Cortés-López M, Chamely P, Hawkins AG, Stanley RF, Swett AD, Ganesan S, Mouhieddine TH, Dai X, Kluegel L, Chen C, Batta K, Furer N, Vedula RS, Beaulaurier J, Drong AW, Hickey S, Dusaj N, Mullokandov G, Stasiw AM, Su J, Chaligné R, Juul S, Harrington E, Knowles DA, Potenski CJ, Wiseman DH, Tanay A, Shlush L, Lindsley RC, Ghobrial IM, Taylor J, Abdel-Wahab O, Gaiti F, Landau DA. Single-cell multi-omics defines the cell-type-specific impact of splicing aberrations in human hematopoietic clonal outgrowths. Cell Stem Cell 2023; 30:1262-1281.e8. [PMID: 37582363 PMCID: PMC10528176 DOI: 10.1016/j.stem.2023.07.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 05/28/2023] [Accepted: 07/18/2023] [Indexed: 08/17/2023]
Abstract
RNA splicing factors are recurrently mutated in clonal blood disorders, but the impact of dysregulated splicing in hematopoiesis remains unclear. To overcome technical limitations, we integrated genotyping of transcriptomes (GoT) with long-read single-cell transcriptomics and proteogenomics for single-cell profiling of transcriptomes, surface proteins, somatic mutations, and RNA splicing (GoT-Splice). We applied GoT-Splice to hematopoietic progenitors from myelodysplastic syndrome (MDS) patients with mutations in the core splicing factor SF3B1. SF3B1mut cells were enriched in the megakaryocytic-erythroid lineage, with expansion of SF3B1mut erythroid progenitor cells. We uncovered distinct cryptic 3' splice site usage in different progenitor populations and stage-specific aberrant splicing during erythroid differentiation. Profiling SF3B1-mutated clonal hematopoiesis samples revealed that erythroid bias and cell-type-specific cryptic 3' splice site usage in SF3B1mut cells precede overt MDS. Collectively, GoT-Splice defines the cell-type-specific impact of somatic mutations on RNA splicing, from early clonal outgrowths to overt neoplasia, directly in human samples.
Collapse
Affiliation(s)
- Mariela Cortés-López
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Paulina Chamely
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Allegra G Hawkins
- Childhood Cancer Data Lab, Alex's Lemonade Stand Foundation, Philadelphia, PA, USA
| | - Robert F Stanley
- Molecular Pharmacology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ariel D Swett
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Saravanan Ganesan
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Tarek H Mouhieddine
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Xiaoguang Dai
- Oxford Nanopore Technologies Inc., New York, NY, USA
| | - Lloyd Kluegel
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Celine Chen
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Tri-Institutional MD-PhD Program, Weill Cornell Medicine, Rockefeller University, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Kiran Batta
- Division of Cancer Sciences, The University of Manchester, Manchester, UK
| | - Nili Furer
- Weizmann Institute of Science, Department of Molecular Cell Biology, Rehovot, Israel
| | - Rahul S Vedula
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | | | | | - Scott Hickey
- Oxford Nanopore Technologies Inc., San Francisco, CA, USA
| | - Neville Dusaj
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Tri-Institutional MD-PhD Program, Weill Cornell Medicine, Rockefeller University, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Gavriel Mullokandov
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Adam M Stasiw
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Jiayu Su
- New York Genome Center, New York, NY, USA; Department of Systems Biology, Columbia University, New York, NY, USA
| | - Ronan Chaligné
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Sissel Juul
- Oxford Nanopore Technologies Inc., New York, NY, USA
| | | | - David A Knowles
- New York Genome Center, New York, NY, USA; Department of Systems Biology, Columbia University, New York, NY, USA; Department of Computer Science, Columbia University, New York, NY, USA
| | - Catherine J Potenski
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Daniel H Wiseman
- Division of Cancer Sciences, The University of Manchester, Manchester, UK
| | - Amos Tanay
- Weizmann Institute of Science, Department of Computer Science and Applied Mathematics, Rehovot, Israel
| | - Liran Shlush
- Weizmann Institute of Science, Department of Molecular Cell Biology, Rehovot, Israel
| | - Robert C Lindsley
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Irene M Ghobrial
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Justin Taylor
- Sylvester Comprehensive Cancer Center, University of Miami, Miller School of Medicine, Miami, FL, USA
| | - Omar Abdel-Wahab
- Molecular Pharmacology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Federico Gaiti
- University Health Network, Princess Margaret Cancer Centre, Toronto, ON, Canada; University of Toronto, Medical Biophysics, Toronto, ON, Canada.
| | - Dan A Landau
- New York Genome Center, New York, NY, USA; Division of Hematology and Medical Oncology, Department of Medicine and Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA; Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
36
|
Abril-Pla O, Andreani V, Carroll C, Dong L, Fonnesbeck CJ, Kochurov M, Kumar R, Lao J, Luhmann CC, Martin OA, Osthege M, Vieira R, Wiecki T, Zinkov R. PyMC: a modern, and comprehensive probabilistic programming framework in Python. PeerJ Comput Sci 2023; 9:e1516. [PMID: 37705656 PMCID: PMC10495961 DOI: 10.7717/peerj-cs.1516] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 07/13/2023] [Indexed: 09/15/2023]
Abstract
PyMC is a probabilistic programming library for Python that provides tools for constructing and fitting Bayesian models. It offers an intuitive, readable syntax that is close to the natural syntax statisticians use to describe models. PyMC leverages the symbolic computation library PyTensor, allowing it to be compiled into a variety of computational backends, such as C, JAX, and Numba, which in turn offer access to different computational architectures including CPU, GPU, and TPU. Being a general modeling framework, PyMC supports a variety of models including generalized hierarchical linear regression and classification, time series, ordinary differential equations (ODEs), and non-parametric models such as Gaussian processes (GPs). We demonstrate PyMC's versatility and ease of use with examples spanning a range of common statistical models. Additionally, we discuss the positive role of PyMC in the development of the open-source ecosystem for probabilistic programming.
Collapse
Affiliation(s)
| | - Virgile Andreani
- Biomedical Engineering Department, Boston University, Boston, United States of America
- Biological Design Center, Boston University, Boston, United States of America
| | | | - Larry Dong
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Child Health Evaluative Sciences, The Hospital for Sick Children, Toronto, Canada
| | - Christopher J. Fonnesbeck
- Baseball Operations Research and Development, Philadelphia Phillies, Philadelphia, United States of America
| | | | - Ravin Kumar
- Google, Mountain View, CA, United States of America
| | | | - Christian C. Luhmann
- Department of Psychology, Stony Brook University, Stony Brook, United States of America
- Institute for Advanced Computational Science, Stony Brook University, Stony Brook NY, United States of America
| | - Osvaldo A. Martin
- IMASL-CONICET, Universidad Nacional de San Luis, San Luis, Argentina
| | | | | | | | | |
Collapse
|
37
|
Engelhard CA, Khani S, Derdak S, Bilban M, Kornfeld JW. Nanopore sequencing unveils the complexity of the cold-activated murine brown adipose tissue transcriptome. iScience 2023; 26:107190. [PMID: 37564700 PMCID: PMC10410515 DOI: 10.1016/j.isci.2023.107190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/28/2023] [Accepted: 06/16/2023] [Indexed: 08/12/2023] Open
Abstract
Alternative transcription increases transcriptome complexity by expression of multiple transcripts per gene. Annotation and quantification of transcripts using short-read sequencing is non-trivial. Long-read sequencing aims at overcoming these problems by sequencing full-length transcripts. Activation of brown adipose tissue (BAT) thermogenesis involves major transcriptomic remodeling and positively affects metabolism via increased energy expenditure. We benchmark Oxford Nanopore Technology (ONT) long-read sequencing protocols to Illumina short-read sequencing assessing alignment characteristics, gene and transcript detection and quantification, differential gene and transcript expression, transcriptome reannotation, and differential transcript usage (DTU). We find ONT sequencing is superior to Illumina for transcriptome reassembly, reducing the risk of false-positive events by unambiguously mapping reads to transcripts. We identified novel isoforms of genes undergoing DTU in cold-activated BAT including Cars2, Adtrp, Acsl5, Scp2, Aldoa, and Pde4d, validated by real-time PCR. The reannotated murine BAT transcriptome established here provides a framework for future investigations into the regulation of BAT.
Collapse
Affiliation(s)
- Christoph Andreas Engelhard
- Department for Biochemistry and Molecular Biology (BMB), University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| | - Sajjad Khani
- Max Planck Institute for Metabolism Research, Gleueler Strasse 50, 50931 Cologne, Germany
- Cologne Excellence Cluster on Cellular Stress Responses in Ageing-Associated Diseases (CECAD), University of Cologne, Cologne, Germany
| | - Sophia Derdak
- Core Facilities, Medical University of Vienna, Lazarettgasse 14, 1090 Vienna, Austria
| | - Martin Bilban
- Department of Laboratory Medicine & Core Facilities, Medical University of Vienna, Waehringer Guertel 18-20, 1090 Vienna, Austria
| | - Jan-Wilhelm Kornfeld
- Department for Biochemistry and Molecular Biology (BMB), University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| |
Collapse
|
38
|
Wierczeiko A, Pastore S, Mündnich S, Busch AM, Dietrich V, Helm M, Butto T, Gerber S. NanopoReaTA: a user-friendly tool for nanopore-seq real-time transcriptional analysis. Bioinformatics 2023; 39:btad492. [PMID: 37549052 PMCID: PMC10435370 DOI: 10.1093/bioinformatics/btad492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 07/12/2023] [Accepted: 08/05/2023] [Indexed: 08/09/2023] Open
Abstract
SUMMARY Oxford Nanopore Technologies' (ONT) sequencing platform offers an excellent opportunity to perform real-time analysis during sequencing. This feature allows for early insights into experimental data and accelerates a potential decision-making process for further analysis, which can be particularly relevant in the clinical context. Although some tools for the real-time analysis of DNA-sequencing data already exist, there is currently no application available for differential transcriptome data analysis designed for scientists or physicians with limited bioinformatics knowledge. Here, we introduce NanopoReaTA, a user-friendly real-time analysis toolbox for RNA-sequencing data from ONT. Sequencing results from a running or finished experiment are processed through an R Shiny-based graphical user interface with an integrated Nextflow pipeline for whole transcriptome or gene-specific analyses. NanopoReaTA provides visual snapshots of a sequencing run in progress, thus enabling interactive sequencing and rapid decision making that could also be applied to clinical cases. AVAILABILITY AND IMPLEMENTATION Github https://github.com/AnWiercze/NanopoReaTA; Zenodo https://doi.org/10.5281/zenodo.8099825.
Collapse
Affiliation(s)
- Anna Wierczeiko
- Institute of Human Genetics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz 55131, Germany
| | - Stefan Pastore
- Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg-University Mainz, Mainz 55128, Germany
| | - Stefan Mündnich
- Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg-University Mainz, Mainz 55128, Germany
| | - Anne M Busch
- Institute of Human Genetics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz 55131, Germany
| | - Vincent Dietrich
- Institute of Human Genetics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz 55131, Germany
| | - Mark Helm
- Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg-University Mainz, Mainz 55128, Germany
| | - Tamer Butto
- Institute of Pharmaceutical and Biomedical Sciences, Johannes Gutenberg-University Mainz, Mainz 55128, Germany
| | - Susanne Gerber
- Institute of Human Genetics, University Medical Center of the Johannes Gutenberg University Mainz, Mainz 55131, Germany
| |
Collapse
|
39
|
Erdogdu B, Varabyou A, Hicks SC, Salzberg SL, Pertea M. Detecting differential transcript usage in complex diseases with SPIT. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.10.548289. [PMID: 37503064 PMCID: PMC10369883 DOI: 10.1101/2023.07.10.548289] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Differential transcript usage (DTU) plays a crucial role in determining how gene expression differs among cells, tissues, and different developmental stages, thereby contributing to the complexity and diversity of biological systems. In abnormal cells, it can also lead to deficiencies in protein function, potentially leading to pathogenesis of diseases. Detecting such events for single-gene genetic traits is relatively uncomplicated; however, the heterogeneity of populations with complex diseases presents an intricate challenge due to the presence of diverse causal events and undetermined subtypes. SPIT is the first statistical tool that quantifies the heterogeneity in transcript usage within a population and identifies predominant subgroups along with their distinctive sets of DTU events. We provide comprehensive assessments of SPIT's methodology in both single-gene and complex traits and report the results of applying SPIT to analyze brain samples from individuals with schizophrenia. Our analysis reveals previously unreported DTU events in six candidate genes.
Collapse
Affiliation(s)
- Beril Erdogdu
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD, United States
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering; Baltimore, MD, United States
| | - Ales Varabyou
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD, United States
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, United States
| | - Stephanie C Hicks
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD, United States
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, MD, USA
- Malone Center for Engineering in Healthcare, Johns Hopkins University, MD, USA
| | - Steven L Salzberg
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD, United States
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering; Baltimore, MD, United States
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, United States
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, MD, USA
- Department of Genetic Medicine, Johns Hopkins School of Medicine; Baltimore, MD, United States
| | - Mihaela Pertea
- Center for Computational Biology, Johns Hopkins University; Baltimore, MD, United States
- Department of Biomedical Engineering, Johns Hopkins School of Medicine and Whiting School of Engineering; Baltimore, MD, United States
- Department of Computer Science, Johns Hopkins University; Baltimore, MD, United States
- Department of Genetic Medicine, Johns Hopkins School of Medicine; Baltimore, MD, United States
| |
Collapse
|
40
|
Dolgalev G, Poverennaya E. Quantitative Analysis of Isoform Switching in Cancer. Int J Mol Sci 2023; 24:10065. [PMID: 37373214 DOI: 10.3390/ijms241210065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 05/26/2023] [Accepted: 06/11/2023] [Indexed: 06/29/2023] Open
Abstract
Over the past 8 years, multiple studies examined the phenomenon of isoform switching in human cancers and discovered that isoform switching is widespread, with hundreds to thousands of such events per cancer type. Although all of these studies used slightly different definitions of isoform switching, which in part led to a rather poor overlap of their results, they all leveraged transcript usage, a proportion of the transcript's expression in the total expression level of the parent gene, to detect isoform switching. However, how changes in transcript usage correlate with changes in transcript expression is not sufficiently explored. In this article, we adopt the most common definition of isoform switching and use a state-of-the-art tool for the analysis of differential transcript usage, SatuRn, to detect isoform switching events in 12 cancer types. We analyze the detected events in terms of changes in transcript usage and the relationship between transcript usage and transcript expression on a global scale. The results of our analysis suggest that the relationship between changes in transcript usage and changes in transcript expression is far from straightforward, and that such quantitative information can be effectively used for prioritizing isoform switching events for downstream analyses.
Collapse
|
41
|
Tang AD, Hrabeta-Robinson E, Volden R, Vollmers C, Brooks AN. Detecting haplotype-specific transcript variation in long reads with FLAIR2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.09.544396. [PMID: 37398362 PMCID: PMC10312636 DOI: 10.1101/2023.06.09.544396] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Background RNA-Seq has brought forth significant discoveries regarding aberrations in RNA processing, implicating these RNA variants in a variety of diseases. Aberrant splicing and single nucleotide variants in RNA have been demonstrated to alter transcript stability, localization, and function. In particular, the upregulation of ADAR, an enzyme which mediates adenosine-to-inosine editing, has been previously linked to an increase in the invasiveness of lung ADC cells and associated with splicing regulation. Despite the functional importance of studying splicing and SNVs, short read RNA-Seq has limited the community's ability to interrogate both forms of RNA variation simultaneously. Results We employed long-read technology to obtain full-length transcript sequences, elucidating cis-effects of variants on splicing changes at a single molecule level. We have developed a computational workflow that augments FLAIR, a tool that calls isoform models expressed in long-read data, to integrate RNA variant calls with the associated isoforms that bear them. We generated nanopore data with high sequence accuracy of H1975 lung adenocarcinoma cells with and without knockdown of ADAR. We applied our workflow to identify key inosine-isoform associations to help clarify the prominence of ADAR in tumorigenesis. Conclusions Ultimately, we find that a long-read approach provides valuable insight toward characterizing the relationship between RNA variants and splicing patterns.
Collapse
Affiliation(s)
- Alison D. Tang
- Department of Biomolecular Engineering, University of California, Santa Cruz
| | | | - Roger Volden
- Department of Biomolecular Engineering, University of California, Santa Cruz
| | | | - Angela N. Brooks
- Department of Biomolecular Engineering, University of California, Santa Cruz
| |
Collapse
|
42
|
Leung CS, Rosenzweig SJ, Yoon B, Marinelli NA, Hollingsworth EW, Maguire AM, Cowen MH, Schmidt M, Imitola J, Gamsiz Uzun ED, Lizarraga SB. Dysregulation of the chromatin environment leads to differential alternative splicing as a mechanism of disease in a human model of autism spectrum disorder. Hum Mol Genet 2023; 32:1634-1646. [PMID: 36621967 PMCID: PMC10162432 DOI: 10.1093/hmg/ddad002] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/24/2022] [Accepted: 01/03/2023] [Indexed: 01/10/2023] Open
Abstract
Autism spectrum disorder (ASD) affects 1 in 44 children. Chromatin regulatory proteins are overrepresented among genes that contain high risk variants in ASD. Disruption of the chromatin environment leads to widespread dysregulation of gene expression, which is traditionally thought of as a mechanism of disease pathogenesis associated with ASD. Alternatively, alterations in chromatin dynamics could also lead to dysregulation of alternative splicing, which is understudied as a mechanism of ASD pathogenesis. The anticonvulsant valproic acid (VPA) is a well-known environmental risk factor for ASD that acts as a class I histone deacetylase inhibitor. However, the precise molecular mechanisms underlying defects in human neuronal development associated with exposure to VPA are understudied. To dissect how VPA exposure and subsequent chromatin hyperacetylation influence molecular signatures involved in ASD pathogenesis, we conducted RNA sequencing (RNA-seq) in human cortical neurons that were treated with VPA. We observed that differentially expressed genes (DEGs) were enriched for mRNA splicing, mRNA processing, histone modification and metabolism related gene sets. Furthermore, we observed widespread increases in the number and the type of alternative splicing events. Analysis of differential transcript usage (DTU) showed that exposure to VPA induces extensive alterations in transcript isoform usage across neurodevelopmentally important genes. Finally, we find that DEGs and genes that display DTU overlap with known ASD-risk genes. Altogether, these findings suggest that, in addition to differential gene expression, changes in alternative splicing correlated with alterations in the chromatin environment could act as an additional mechanism of disease in ASD.
Collapse
Affiliation(s)
- Calvin S Leung
- Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI 02912, USA
- Center for Translational Neuroscience, Carney Institute for Brain Science and Brown Institute for Translational Science (BITS), Brown University, Providence, RI 02912, USA
| | - Shoshana J Rosenzweig
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA
- Department of Pathology and Laboratory Medicine, Warren Alpert Medical School of Brown University, Providence, RI 02912, USA
- Department of Pathology and Laboratory Medicine, Rhode Island Hospital and Lifespan Academic Medical Center, Providence, RI 02903, USA
| | - Brian Yoon
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | - Nicholas A Marinelli
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | - Ethan W Hollingsworth
- UCONN Health Comprehensive Multiple Sclerosis Center, Department of Neurology, University of Connecticut School of Medicine, Farmington, CT 06030, USA
- Division of Multiple Sclerosis and Translational Neuroimmunology, Department of Neurology, University of Connecticut School of Medicine, Farmington, CT 06030, USA
| | - Abbie M Maguire
- Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI 02912, USA
- Center for Translational Neuroscience, Carney Institute for Brain Science and Brown Institute for Translational Science (BITS), Brown University, Providence, RI 02912, USA
| | - Mara H Cowen
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA
| | - Michael Schmidt
- Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI 02912, USA
- Center for Translational Neuroscience, Carney Institute for Brain Science and Brown Institute for Translational Science (BITS), Brown University, Providence, RI 02912, USA
| | - Jaime Imitola
- UCONN Health Comprehensive Multiple Sclerosis Center, Department of Neurology, University of Connecticut School of Medicine, Farmington, CT 06030, USA
- Division of Multiple Sclerosis and Translational Neuroimmunology, Department of Neurology, University of Connecticut School of Medicine, Farmington, CT 06030, USA
| | - Ece D Gamsiz Uzun
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA
- Department of Pathology and Laboratory Medicine, Warren Alpert Medical School of Brown University, Providence, RI 02912, USA
- Department of Pathology and Laboratory Medicine, Rhode Island Hospital and Lifespan Academic Medical Center, Providence, RI 02903, USA
| | - Sofia B Lizarraga
- Department of Molecular Biology, Cell Biology and Biochemistry, Brown University, Providence, RI 02912, USA
- Center for Translational Neuroscience, Carney Institute for Brain Science and Brown Institute for Translational Science (BITS), Brown University, Providence, RI 02912, USA
| |
Collapse
|
43
|
Salih MM, Robinson EK, Malekos E, Perez E, Capili A, Kim K, Zhang WZ, Cloonan SM, Carpenter S. LincRNA-Cox2 Regulates Smoke-induced Inflammation in Murine Macrophages. Am J Respir Cell Mol Biol 2023; 68:511-522. [PMID: 36657060 DOI: 10.1165/rcmb.2022-0413oc] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Accepted: 01/19/2023] [Indexed: 01/20/2023] Open
Abstract
Cigarette smoke (CS) exposure is a risk factor for many chronic diseases, including chronic obstructive pulmonary disease, but the mechanism by which smoke exposure can alter homeostasis and bring about chronic inflammation is poorly understood. Here, we showcase a novel role for smoke in regulating long noncoding RNAs, showing that it activates lincRNA-Cox2, which we previously characterized as functional in inflammatory regulation. Exposing lincRNA-Cox2 murine models to smoke in vivo confirmed lincRNA-Cox2 as a regulator of inflammatory gene expression in response to smoke both systemically and within the lung. We also report that lincRNA-Cox2 negatively regulates genes in smoked bone marrow-derived macrophages exposed to LPS stimulation. In addition to the effects on long noncoding RNAs, we also report dysregulated transcription and splicing of inflammatory protein-coding genes in the bone marrow niche after CS exposure in vivo. Collectively, this work provides insights into how innate immune signaling from gene expression to splicing is altered after in vivo exposure to CS and highlights an important new role for lincRNA-Cox2 in regulating immune genes after smoke exposure.
Collapse
Affiliation(s)
- Mays Mohammed Salih
- Department of Molecular, Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, California
| | - Elektra Kantzari Robinson
- Department of Molecular, Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, California
| | - Eric Malekos
- Department of Molecular, Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, California
| | - Elizabeth Perez
- Division of Pulmonary and Critical Care Medicine, Joan and Sanford I. Weill Department of Medicine, Weill Cornell Medicine, New York, New York; and
| | - Allyson Capili
- Division of Pulmonary and Critical Care Medicine, Joan and Sanford I. Weill Department of Medicine, Weill Cornell Medicine, New York, New York; and
| | - Kihwan Kim
- Division of Pulmonary and Critical Care Medicine, Joan and Sanford I. Weill Department of Medicine, Weill Cornell Medicine, New York, New York; and
| | - William Z Zhang
- Division of Pulmonary and Critical Care Medicine, Joan and Sanford I. Weill Department of Medicine, Weill Cornell Medicine, New York, New York; and
| | - Suzanne M Cloonan
- Division of Pulmonary and Critical Care Medicine, Joan and Sanford I. Weill Department of Medicine, Weill Cornell Medicine, New York, New York; and
- School of Medicine, Trinity Biomedical Sciences Institute, Trinity College, Dublin, Ireland
| | - Susan Carpenter
- Department of Molecular, Cell and Developmental Biology, University of California, Santa Cruz, Santa Cruz, California
| |
Collapse
|
44
|
McCabe SD, Nobel AB, Love MI. ACTOR: a latent Dirichlet model to compare expressed isoform proportions to a reference panel. Biostatistics 2023; 24:388-405. [PMID: 33948626 PMCID: PMC10102900 DOI: 10.1093/biostatistics/kxab013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 03/19/2021] [Accepted: 03/23/2021] [Indexed: 11/13/2022] Open
Abstract
The relative proportion of RNA isoforms expressed for a given gene has been associated with disease states in cancer, retinal diseases, and neurological disorders. Examination of relative isoform proportions can help determine biological mechanisms, but such analyses often require a per-gene investigation of splicing patterns. Leveraging large public data sets produced by genomic consortia as a reference, one can compare splicing patterns in a data set of interest with those of a reference panel in which samples are divided into distinct groups, such as tissue of origin, or disease status. We propose A latent Dirichlet model to Compare expressed isoform proportions TO a Reference panel (ACTOR), a latent Dirichlet model with Dirichlet Multinomial observations to compare expressed isoform proportions in a data set to an independent reference panel. We use a variational Bayes procedure to estimate posterior distributions for the group membership of one or more samples. Using the Genotype-Tissue Expression project as a reference data set, we evaluate ACTOR on simulated and real RNA-seq data sets to determine tissue-type classifications of genes. ACTOR is publicly available as an R package at https://github.com/mccabes292/actor.
Collapse
Affiliation(s)
- Sean D McCabe
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599-7400, USA
| | - Andrew B Nobel
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, 318 Hanes Hall, Chapel Hill, NC 27599-3260, USA and Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599-7400, USA
| | - Michael I Love
- Department of Biostatistics, University of North Carolina at Chapel Hill, 135 Dauer Drive, Chapel Hill, NC 27599-7400, USA and Department of Genetics, University of North Carolina at Chapel Hill, 120 Mason Farm Rd, Chapel Hill, NC 27514, USA
| |
Collapse
|
45
|
Siddappa M, Hussain S, Wani SA, White J, Tang H, Gray JS, Jafari H, Wu HC, Long MD, Elhussin I, Karanam B, Wang H, Morgan R, Hardiman G, Adelani IB, Rotimi SO, Murphy AR, Nonn L, Davis MB, Kittles RA, Hughes Halbert C, Sucheston-Campbell LE, Yates C, Campbell MJ. African American Prostate Cancer Displays Quantitatively Distinct Vitamin D Receptor Cistrome-transcriptome Relationships Regulated by BAZ1A. CANCER RESEARCH COMMUNICATIONS 2023; 3:621-639. [PMID: 37082578 PMCID: PMC10112383 DOI: 10.1158/2767-9764.crc-22-0389] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 01/12/2023] [Accepted: 03/07/2023] [Indexed: 04/22/2023]
Abstract
African American (AA) prostate cancer associates with vitamin D3 deficiency, but vitamin D receptor (VDR) genomic actions have not been investigated in this context. We undertook VDR proteogenomic analyses in European American (EA) and AA prostate cell lines and four clinical cohorts. Rapid immunoprecipitation mass spectrometry of endogenous protein (RIME) analyses revealed that nonmalignant AA RC43N prostate cells displayed the greatest dynamic protein content in the VDR complex. Likewise, in AA cells, Assay for Transposase-Accessible Chromatin using sequencing established greater 1α,25(OH)2D3-regulated chromatin accessibility, chromatin immunoprecipitation sequencing revealed significant enhancer-enriched VDR cistrome, and RNA sequencing identified the largest 1α,25(OH)2D3-dependent transcriptome. These VDR functions were significantly corrupted in the isogenic AA RC43T prostate cancer cells, and significantly distinct from EA cell models. We identified reduced expression of the chromatin remodeler, BAZ1A, in three AA prostate cancer cohorts as well as RC43T compared with RC43N. Restored BAZ1A expression significantly increased 1α,25(OH)2D3-regulated VDR-dependent gene expression in RC43T, but not HPr1AR or LNCaP cells. The clinical impact of VDR cistrome-transcriptome relationships were tested in three different clinical prostate cancer cohorts. Strikingly, only in AA patients with prostate cancer, the genes bound by VDR and/or associated with 1α,25(OH)2D3-dependent open chromatin (i) predicted progression from high-grade prostatic intraepithelial neoplasia to prostate cancer; (ii) responded to vitamin D3 supplementation in prostate cancer tumors; (iii) differentially responded to 25(OH)D3 serum levels. Finally, partial correlation analyses established that BAZ1A and components of the VDR complex identified by RIME significantly strengthened the correlation between VDR and target genes in AA prostate cancer only. Therefore, VDR transcriptional control is most potent in AA prostate cells and distorted through a BAZ1A-dependent control of VDR function. Significance Our study identified that genomic ancestry drives the VDR complex composition, genomic distribution, and transcriptional function, and is disrupted by BAZ1A and illustrates a novel driver for AA prostate cancer.
Collapse
Affiliation(s)
- Manjunath Siddappa
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Shahid Hussain
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Sajad A. Wani
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Jason White
- Department of Biology and Center for Cancer Research, Tuskegee University, Tuskegee, Alabama
| | - Hancong Tang
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Jaimie S. Gray
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Hedieh Jafari
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Hsu-Chang Wu
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| | - Mark D. Long
- Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, New York
| | - Isra Elhussin
- Department of Biology and Center for Cancer Research, Tuskegee University, Tuskegee, Alabama
| | - Balasubramanyam Karanam
- Department of Biology and Center for Cancer Research, Tuskegee University, Tuskegee, Alabama
| | - Honghe Wang
- Department of Biology and Center for Cancer Research, Tuskegee University, Tuskegee, Alabama
| | - Rebecca Morgan
- School of Biological Sciences, Institute for Global Food Security, Queen's University Belfast, Belfast, United Kingdom
| | - Gary Hardiman
- School of Biological Sciences, Institute for Global Food Security, Queen's University Belfast, Belfast, United Kingdom
- Department of Medicine, Medical University of South Carolina, Charleston, South Carolina
| | | | - Solomon O. Rotimi
- Department of Biochemistry, Covenant University, Ota, Ogun State, Nigeria
| | - Adam R. Murphy
- Department of Urology, Northwestern Medicine, Chicago, Illinois
| | - Larisa Nonn
- Department of Pathology, University of Illinois at Chicago, Chicago, Illinois
| | - Melissa B. Davis
- Department of Surgery, Weill Cornell Medicine, New York City, New York
| | - Rick A. Kittles
- Division of Health Equities, Department of Population Sciences, City of Hope, Duarte, California
| | - Chanita Hughes Halbert
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California
- Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, California
| | - Lara E. Sucheston-Campbell
- Division of Pharmacy Practice and Science, College of Pharmacy, The Ohio State University, Columbus, Ohio
- Department of Veterinary Biosciences, College of Veterinary Medicine, The Ohio State University, Columbus, Ohio
| | - Clayton Yates
- Department of Biology and Center for Cancer Research, Tuskegee University, Tuskegee, Alabama
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Department of Oncology Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, Maryland
| | - Moray J. Campbell
- Pharmaceutics and Pharmaceutical Chemistry, College of Pharmacy, The Ohio State University, Columbus, Ohio
| |
Collapse
|
46
|
Ren Y, Tseng E, Smith TPL, Hiendleder S, Williams JL, Low WY. Long read isoform sequencing reveals hidden transcriptional complexity between cattle subspecies. BMC Genomics 2023; 24:108. [PMID: 36915055 PMCID: PMC10012480 DOI: 10.1186/s12864-023-09212-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 02/27/2023] [Indexed: 03/16/2023] Open
Abstract
The Iso-Seq method of full-length cDNA sequencing is suitable to quantify differentially expressed genes (DEGs), transcripts (DETs) and transcript usage (DTU). However, the higher cost of Iso-Seq relative to RNA-seq has limited the comparison of both methods. Transcript abundance estimated by RNA-seq and deep Iso-Seq data for fetal liver from two cattle subspecies were compared to evaluate concordance. Inter-sample correlation of gene- and transcript-level abundance was higher within technology than between technologies. Identification of DEGs between the cattle subspecies depended on sequencing method with only 44 genes identified by both that included 6 novel genes annotated by Iso-Seq. There was a pronounced difference between Iso-Seq and RNA-seq results at transcript-level wherein Iso-Seq revealed several magnitudes more transcript abundance and usage differences between subspecies. Factors influencing DEG identification included size selection during Iso-Seq library preparation, average transcript abundance, multi-mapping of RNA-seq reads to the reference genome, and overlapping coordinates of genes. Some DEGs called by RNA-seq alone appear to be sequence duplication artifacts. Among the 44 DEGs identified by both technologies some play a role in immune system, thyroid function and cell growth. Iso-Seq revealed hidden transcriptional complexity in DEGs, DETs and DTU genes between cattle subspecies previously missed by RNA-seq.
Collapse
Affiliation(s)
- Yan Ren
- The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide, Roseworthy, Adelaide, SA, 5371, Australia
| | | | - Timothy P L Smith
- U.S. Meat Animal Research Center, USDA-ARS, Clay Center, Clay Center, Nebraska, USA
| | - Stefan Hiendleder
- The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide, Roseworthy, Adelaide, SA, 5371, Australia
- Robinson Research Institute, The University of Adelaide, North Adelaide, Adelaide, SA, 5006, Australia
| | - John L Williams
- The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide, Roseworthy, Adelaide, SA, 5371, Australia
- Department of Animal Science, Food and Nutrition, Università Cattolica del Sacro Cuore, 29122, Piacenza, Italy
| | - Wai Yee Low
- The Davies Research Centre, School of Animal and Veterinary Sciences, University of Adelaide, Roseworthy, Adelaide, SA, 5371, Australia.
| |
Collapse
|
47
|
Integrative Meta-Analysis of Huntington's Disease Transcriptome Landscape. Genes (Basel) 2022; 13:genes13122385. [PMID: 36553652 PMCID: PMC9777612 DOI: 10.3390/genes13122385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 11/24/2022] [Accepted: 12/12/2022] [Indexed: 12/23/2022] Open
Abstract
Huntington's disease (HD) is a neurodegenerative disorder with autosomal dominant inheritance caused by glutamine expansion in the Huntingtin gene (HTT). Striatal projection neurons (SPNs) in HD are more vulnerable to cell death. The executive striatal population is directly connected with the Brodmann Area (BA9), which is mainly involved in motor functions. Analyzing the disease samples from BA9 from the SRA database provides insights related to neuron degeneration, which helps to identify a promising therapeutic strategy. Most gene expression studies examine the changes in expression and associated biological functions. In this study, we elucidate the relationship between variants and their effect on gene/downstream transcript expression. We computed gene and transcript abundance and identified variants from RNA-seq data using various pipelines. We predicted the effect of genome-wide association studies (GWAS)/novel variants on regulatory functions. We found that many variants affect the histone acetylation pattern in HD, thereby perturbing the transcription factor networks. Interestingly, some variants affect miRNA binding as well as their downstream gene expression. Tissue-specific network analysis showed that mitochondrial, neuroinflammation, vasculature, and angiogenesis-related genes are disrupted in HD. From this integrative omics analysis, we propose that abnormal neuroinflammation acts as a two-edged sword that indirectly affects the vasculature and associated energy metabolism. Rehabilitation of blood-brain barrier functionality and energy metabolism may secure the neuron from cell death.
Collapse
|
48
|
Comparative meta-analysis of host transcriptional response during Streptococcus pneumoniae carriage or infection. Microb Pathog 2022; 173:105816. [DOI: 10.1016/j.micpath.2022.105816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 09/16/2022] [Accepted: 10/03/2022] [Indexed: 11/06/2022]
|
49
|
Makhnovskii PA, Gusev OA, Bokov RO, Gazizova GR, Vepkhvadze TF, Lysenko EA, Vinogradova OL, Kolpakov FA, Popov DV. Alternative transcription start sites contribute to acute-stress-induced transcriptome response in human skeletal muscle. Hum Genomics 2022; 16:24. [PMID: 35869513 PMCID: PMC9308330 DOI: 10.1186/s40246-022-00399-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 07/14/2022] [Indexed: 11/18/2022] Open
Abstract
Background More than half of human protein-coding genes have an alternative transcription start site (TSS). We aimed to investigate the contribution of alternative TSSs to the acute-stress-induced transcriptome response in human tissue (skeletal muscle) using the cap analysis of gene expression approach. TSSs were examined at baseline and during recovery after acute stress (a cycling exercise). Results We identified 44,680 CAGE TSS clusters (including 3764 first defined) belonging to 12,268 genes and annotated for the first time 290 TSSs belonging to 163 genes. The transcriptome dynamically changes during the first hours after acute stress; the change in the expression of 10% of genes was associated with the activation of alternative TSSs, indicating differential TSSs usage. The majority of the alternative TSSs do not increase proteome complexity suggesting that the function of thousands of alternative TSSs is associated with the fine regulation of mRNA isoform expression from a gene due to the transcription factor-specific activation of various alternative TSSs. We identified individual muscle promoter regions for each TSS using muscle open chromatin data (ATAC-seq and DNase-seq). Then, using the positional weight matrix approach we predicted time course activation of “classic” transcription factors involved in response of skeletal muscle to contractile activity, as well as diversity of less/un-investigated factors. Conclusions Transcriptome response induced by acute stress related to activation of the alternative TSSs indicates that differential TSSs usage is an essential mechanism of fine regulation of gene response to stress stimulus. A comprehensive resource of accurate TSSs and individual promoter regions for each TSS in muscle was created. This resource together with the positional weight matrix approach can be used to accurate prediction of TFs in any gene(s) of interest involved in the response to various stimuli, interventions or pathological conditions in human skeletal muscle. Supplementary Information The online version contains supplementary material available at 10.1186/s40246-022-00399-8.
Collapse
|
50
|
Castaldi PJ, Abood A, Farber CR, Sheynkman GM. Bridging the splicing gap in human genetics with long-read RNA sequencing: finding the protein isoform drivers of disease. Hum Mol Genet 2022; 31:R123-R136. [PMID: 35960994 PMCID: PMC9585682 DOI: 10.1093/hmg/ddac196] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 02/04/2023] Open
Abstract
Aberrant splicing underlies many human diseases, including cancer, cardiovascular diseases and neurological disorders. Genome-wide mapping of splicing quantitative trait loci (sQTLs) has shown that genetic regulation of alternative splicing is widespread. However, identification of the corresponding isoform or protein products associated with disease-associated sQTLs is challenging with short-read RNA-seq, which cannot precisely characterize full-length transcript isoforms. Furthermore, contemporary sQTL interpretation often relies on reference transcript annotations, which are incomplete. Solutions to these issues may be found through integration of newly emerging long-read sequencing technologies. Long-read sequencing offers the capability to sequence full-length mRNA transcripts and, in some cases, to link sQTLs to transcript isoforms containing disease-relevant protein alterations. Here, we provide an overview of sQTL mapping approaches, the use of long-read sequencing to characterize sQTL effects on isoforms, the linkage of RNA isoforms to protein-level functions and comment on future directions in the field. Based on recent progress, long-read RNA sequencing promises to be part of the human disease genetics toolkit to discover and treat protein isoforms causing rare and complex diseases.
Collapse
Affiliation(s)
- Peter J Castaldi
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
- Division of General Medicine and Primary Care, Department of Medicine, Brigham and Women’s Hospital, Boston, MA 02115, USA
| | - Abdullah Abood
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
| | - Charles R Farber
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Public Health Sciences, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
| | - Gloria M Sheynkman
- Center for Public Health Genomics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Biochemistry and Molecular Genetics, School of Medicine, University of Virginia, Charlottesville, VA 22903, USA
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA 22903, USA
- UVA Comprehensive Cancer Center, University of Virginia, Charlottesville, VA 22903, USA
| |
Collapse
|