1
|
Yoon Y, Bournique E, Soles LV, Yin H, Chu HF, Yin C, Zhuang Y, Liu X, Liu L, Jeong J, Yu C, Valdez M, Tian L, Huang L, Shi X, Seelig G, Ding F, Tong L, Buisson R, Shi Y. RBBP6 anchors pre-mRNA 3' end processing to nuclear speckles for efficient gene expression. Mol Cell 2025; 85:555-570.e8. [PMID: 39798570 PMCID: PMC11805622 DOI: 10.1016/j.molcel.2024.12.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2024] [Revised: 11/01/2024] [Accepted: 12/16/2024] [Indexed: 01/15/2025]
Abstract
Pre-mRNA 3' processing is an integral step in mRNA biogenesis. However, where this process occurs in the nucleus remains unknown. Here, we demonstrate that nuclear speckles (NSs), membraneless organelles enriched with splicing factors, are major sites for pre-mRNA 3' processing in human cells. We show that the essential pre-mRNA 3' processing factor retinoblastoma-binding protein 6 (RBBP6) associates strongly with NSs via its C-terminal intrinsically disordered region (IDR). Importantly, although the conserved N-terminal domain (NTD) of RBBP6 is sufficient for pre-mRNA 3' processing in vitro, its IDR-mediated association with NSs is required for efficient pre-mRNA 3' processing in cells. Through proximity labeling analyses, we provide evidence that pre-mRNA 3' processing for over 50% of genes occurs near NSs. We propose that NSs serve as hubs for RNA polymerase II transcription, pre-mRNA splicing, and 3' processing, thereby enhancing the efficiency and coordination of different gene expression steps.
Collapse
Affiliation(s)
- Yoseop Yoon
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Elodie Bournique
- Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Lindsey V Soles
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Hong Yin
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA 92697, USA
| | - Hsu-Feng Chu
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Christopher Yin
- Department of Electrical and Computer Engineering, University of Washington, Seattle, Seattle, WA 98195, USA
| | - Yinyin Zhuang
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA 92697, USA
| | - Xiangyang Liu
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Liang Liu
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Joshua Jeong
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Clinton Yu
- Department of Physiology and Biophysics, University of California, Irvine, Irvine, CA 92697, USA
| | - Marielle Valdez
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Lusong Tian
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Lan Huang
- Department of Physiology and Biophysics, University of California, Irvine, Irvine, CA 92697, USA
| | - Xiaoyu Shi
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA 92697, USA; Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA 92697, USA; Department of Chemistry, University of California, Irvine, Irvine, CA 92697, USA
| | - Georg Seelig
- Department of Electrical and Computer Engineering, University of Washington, Seattle, Seattle, WA 98195, USA; Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Seattle, WA 98195, USA
| | - Fangyuan Ding
- Department of Biomedical Engineering, University of California, Irvine, Irvine, CA 92697, USA
| | - Liang Tong
- Department of Biological Sciences, Columbia University, New York, NY 10027, USA
| | - Rémi Buisson
- Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA
| | - Yongsheng Shi
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California, Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
2
|
Yeganeh Markid T, Pourahmadiyan A, Hamzeh S, Sharifi-Bonab M, Asadi MR, Jalaiei A, Rezazadeh M, Ghafouri-Fard S. A special focus on polyadenylation and alternative polyadenylation in neurodegenerative diseases: A systematic review. J Neurochem 2025; 169:e16255. [PMID: 39556113 DOI: 10.1111/jnc.16255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2024] [Revised: 10/14/2024] [Accepted: 10/15/2024] [Indexed: 11/19/2024]
Abstract
Neurodegenerative diseases (NDDs) are one of the prevailing conditions characterized by progressive neuronal loss. Polyadenylation (PA) and alternative polyadenylation (APA) are the two main post-transcriptional events that regulate neuronal gene expression and protein production. This systematic review analyzed the available literature on the role of PA and APA in NDDs, with an emphasis on their contributions to disease development. A comprehensive literature search was performed using the PubMed, Scopus, Cochrane, Google Scholar, Embase, Web of Science, and ProQuest databases. The search strategy was developed based on the framework introduced by Arksey and O'Malley and supplemented by the inclusion and exclusion criteria. The study selection was performed by two independent reviewers. Extraction and data organization were performed in accordance with the predefined variables. Subsequently, quantitative and qualitative analyses were performed. Forty-seven studies were included, related to a variety of NDDs, namely Alzheimer's disease, Parkinson's disease, Huntington's disease, and amyotrophic lateral sclerosis. Disease induction was performed using different models, including human tissues, animal models, and cultured cells. Most investigations were related to PA, although some were related to APA or both. Amyloid precursor protein (APP), Tau, SNCA, and STMN2 were the major genes identified; most of the altered PA patterns were related to mRNA stability and translation efficiency. This review particularly underscores the key roles of PA and APA in the pathogenesis of NDDs through their mechanisms that contribute to gene expression dysregulation, protein aggregation, and neuronal dysfunction. Insights into these mechanisms may lead to new therapeutic strategies focused on the modulation of PA and APA activities. Further research is required to investigate the translational potential of targeting these pathways for NDD treatment.
Collapse
Affiliation(s)
- Tarlan Yeganeh Markid
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Azam Pourahmadiyan
- Cellular and Molecular Research Center, Basic Health Sciences Institute, Shahrekord University of Medical Sciences, Shahrekord, Iran
| | - Soroosh Hamzeh
- Student Research Committee, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Mirmohsen Sharifi-Bonab
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Mohamad Reza Asadi
- Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Abbas Jalaiei
- Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Maryam Rezazadeh
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
- Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Soudeh Ghafouri-Fard
- Department of Medical Genetics, Faculty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
3
|
Gao Y, Shaw VR, Amos CI. Alternative polyadenylation shapes the molecular and clinical features of lung adenocarcinoma. Hum Mol Genet 2025; 34:1-10. [PMID: 39487796 DOI: 10.1093/hmg/ddae150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 08/14/2024] [Indexed: 11/04/2024] Open
Abstract
Alternative polyadenylation (APA) is a major mechanism of post-transcriptional regulation that affects mRNA stability, localization and translation efficiency. Previous pan-cancer studies have revealed that APA is frequently disrupted in cancer and is associated with patient outcomes. Yet, little is known about cancer type-specific APA alterations. Here, we integrated RNA-sequencing data from a Korean cohort (GEO: GSE40419) and The Cancer Genome Atlas (TCGA) to comprehensively analyze APA alterations in lung adenocarcinomas (LUADs). Comparing expression levels of core genes involved in polyadenylation, we find that overall, the set of 28 of 31 genes are upregulated, with CSTF2 particularly upregulated. We observed broad and recurrent APA changes in LUAD growth-promoting genes. In addition, we find enrichment of APA events in genes associated with known LUAD pathways and an increased heterogeneity in polyadenylation (polyA) site usage of proliferation-associated genes. Upon further investigation, we report smoking-specific APA changes are also highly relevant to LUAD development. Overall, our in-depth analysis reveals APA as an important driver for the molecular and clinical features of lung adenocarcinoma.
Collapse
Affiliation(s)
- Yipeng Gao
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
- Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - Vikram R Shaw
- Graduate Program in Quantitative and Computational Biosciences, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| | - Christopher I Amos
- Department of Medicine, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, United States
| |
Collapse
|
4
|
Ou J, Liu H, Park S, Green MR, Zhu LJ. InPAS: An R/Bioconductor Package for Identifying Novel Polyadenylation Sites and Alternative Polyadenylation from Bulk RNA-seq Data. Front Biosci (Schol Ed) 2024; 16:21. [PMID: 39736014 DOI: 10.31083/j.fbs1604021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 09/20/2024] [Accepted: 10/10/2024] [Indexed: 12/31/2024]
Abstract
BACKGROUND Alternative cleavage and polyadenylation (APA) is a crucial post-transcriptional gene regulation mechanism that regulates gene expression in eukaryotes by increasing the diversity and complexity of both the transcriptome and proteome. Despite the development of more than a dozen experimental methods over the last decade to identify and quantify APA events, widespread adoption of these methods has been limited by technical, financial, and time constraints. Consequently, APA remains poorly understood in most eukaryotes. However, RNA sequencing (RNA-seq) technology has revolutionized transcriptome profiling and recent studies have shown that RNA-seq data can be leveraged to identify and quantify APA events. RESULTS To fully capitalize on the exponentially growing RNA-seq data, we developed InPAS (Identification of Novel alternative PolyAdenylation Sites), an R/Bioconductor package for accurate identification of novel and known cleavage and polyadenylation sites (CPSs), as well as quantification of APA from RNA-seq data of various experimental designs. Compared to other APA analysis tools, InPAS offers several important advantages, including the ability to detect both novel proximal and distal CPSs, to fine tune positions of CPSs using a naïve Bayes classifier based on flanking sequence features, and to identify APA events from RNA-seq data of complex experimental designs using linear models. We benchmarked the performance of InPAS and other leading tools using simulated and experimental RNA-seq data with matched 3'-end RNA-seq data. Our results reveal that InPAS frequently outperforms existing tools in terms of precision, sensitivity, and specificity. Furthermore, we demonstrate its scalability and versatility by applying it to large, diverse RNA-seq datasets. CONCLUSIONS InPAS is an efficient and robust tool for identifying and quantifying APA events using readily accessible conventional RNA-seq data. Its versatility opens doors to explore APA regulation across diverse eukaryotic systems with various experimental designs. We believe that InPAS will drive APA research forward, deepening our understanding of its role in regulating gene expression, and potentially leading to the discovery of biomarkers or therapeutics for diseases.
Collapse
Affiliation(s)
- Jianhong Ou
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
- Regeneration Center, Duke University School of Medicine, Duke University, Durham, NC 27701, USA
| | - Haibo Liu
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Sungmi Park
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Michael R Green
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Lihua Julie Zhu
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
- Department of Molecular Medicine, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
- Department of Genomics and Computational Biology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| |
Collapse
|
5
|
Zhang X, Liu F, Zhou Y. Coupling of alternative splicing and alternative polyadenylation. Acta Biochim Biophys Sin (Shanghai) 2024; 57:22-32. [PMID: 39632657 PMCID: PMC11802343 DOI: 10.3724/abbs.2024211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Accepted: 10/28/2024] [Indexed: 12/07/2024] Open
Abstract
RNA splicing and 3'-cleavage and polyadenylation (CPA) are essential processes for the maturation of RNA. There have been extensive independent studies of these regulated processing events, including alternative splicing (AS) and alternative polyadenylation (APA). However, growing evidence suggests potential crosstalk between splicing and 3'-end processing in regulating AS or APA. Here, we first provide a brief overview of the molecular machines involved in splicing and 3'-end processing events, and then review recent studies on the functions and mechanisms of the crosstalk between the two processes. On the one hand, 3'-end processing can affect splicing, as 3'-end processing factors and CPA-generated polyA tail promote the splicing of the last intron. Beyond that, 3'-end processing factors can also influence the splicing of internal and terminal exons. Those 3'-end processing factors can also interact with different RNA-binding proteins (RBPs) to exert their effects on AS. The length of 3' untranslated region (3' UTR) can affect the splicing of upstream exons. On the other hand, splicing and CPA may compete within introns in generating different products. Furthermore, splicing within the 3' UTR is a significant factor contributing to 3' UTR diversity. Splicing also influences 3'-end processing through the actions of certain splicing factors. Interestingly, some classical RBPs play dual roles in both splicing and 3'-end processing. Finally, we discuss how long-read sequencing technologies aid in understanding the coordination of AS-APA events and envision that these findings may potentially promote the development of new strategies for disease diagnosis and treatment.
Collapse
Affiliation(s)
- Xueying Zhang
- College of Life SciencesTaiKang Center for Life and Medical SciencesHubei Key Laboratory of Cell HomeostasisRNA InstituteWuhan UniversityWuhan430072China
| | - Feiyan Liu
- College of Life SciencesTaiKang Center for Life and Medical SciencesHubei Key Laboratory of Cell HomeostasisRNA InstituteWuhan UniversityWuhan430072China
| | - Yu Zhou
- College of Life SciencesTaiKang Center for Life and Medical SciencesHubei Key Laboratory of Cell HomeostasisRNA InstituteWuhan UniversityWuhan430072China
- Frontier Science Center for Immunology and MetabolismWuhan UniversityWuhan430072China
| |
Collapse
|
6
|
Du C, Fan W, Zhou Y. Integrated Biochemical and Computational Methods for Deciphering RNA-Processing Codes. WILEY INTERDISCIPLINARY REVIEWS. RNA 2024; 15:e1875. [PMID: 39523464 DOI: 10.1002/wrna.1875] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 09/23/2024] [Accepted: 10/21/2024] [Indexed: 11/16/2024]
Abstract
RNA processing involves steps such as capping, splicing, polyadenylation, modification, and nuclear export. These steps are essential for transforming genetic information in DNA into proteins and contribute to RNA diversity and complexity. Many biochemical methods have been developed to profile and quantify RNAs, as well as to identify the interactions between RNAs and RNA-binding proteins (RBPs), especially when coupled with high-throughput sequencing technologies. With the rapid accumulation of diverse data, it is crucial to develop computational methods to convert the big data into biological knowledge. In particular, machine learning and deep learning models are commonly utilized to learn the rules or codes governing the transformation from DNA sequences to intriguing RNAs based on manually designed or automatically extracted features. When precise enough, the RNA codes can be incredibly useful for predicting RNA products, decoding the molecular mechanisms, forecasting the impact of disease variants on RNA processing events, and identifying driver mutations. In this review, we systematically summarize the biochemical and computational methods for deciphering five important RNA codes related to alternative splicing, alternative polyadenylation, RNA localization, RNA modifications, and RBP binding. For each code, we review the main types of experimental methods used to generate training data, as well as the key features, strategic model structures, and advantages of representative tools. We also discuss the challenges encountered in developing predictive models using large language models and extensive domain knowledge. Additionally, we highlight useful resources and propose ways to improve computational tools for studying RNA codes.
Collapse
Affiliation(s)
- Chen Du
- College of Life Sciences, TaiKang Center for Life and Medical Sciences, RNA Institute, Wuhan University, Wuhan, China
| | - Weiliang Fan
- College of Life Sciences, TaiKang Center for Life and Medical Sciences, RNA Institute, Wuhan University, Wuhan, China
| | - Yu Zhou
- College of Life Sciences, TaiKang Center for Life and Medical Sciences, RNA Institute, Wuhan University, Wuhan, China
- Frontier Science Center for Immunology and Metabolism, Wuhan University, Wuhan, China
- State Key Laboratory of Virology, Wuhan University, Wuhan, China
| |
Collapse
|
7
|
Aygün N, Vuong C, Krupa O, Mory J, Le BD, Valone JM, Liang D, Shafie B, Zhang P, Salinda A, Wen C, Gandal MJ, Love MI, de la Torre-Ubieta L, Stein JL. Genetics of cell-type-specific post-transcriptional gene regulation during human neurogenesis. Am J Hum Genet 2024; 111:1877-1898. [PMID: 39168119 PMCID: PMC11393701 DOI: 10.1016/j.ajhg.2024.07.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 07/18/2024] [Accepted: 07/23/2024] [Indexed: 08/23/2024] Open
Abstract
The function of some genetic variants associated with brain-relevant traits has been explained through colocalization with expression quantitative trait loci (eQTL) conducted in bulk postmortem adult brain tissue. However, many brain-trait associated loci have unknown cellular or molecular function. These genetic variants may exert context-specific function on different molecular phenotypes including post-transcriptional changes. Here, we identified genetic regulation of RNA editing and alternative polyadenylation (APA) within a cell-type-specific population of human neural progenitors and neurons. More RNA editing and isoforms utilizing longer polyadenylation sequences were observed in neurons, likely due to higher expression of genes encoding the proteins mediating these post-transcriptional events. We also detected hundreds of cell-type-specific editing quantitative trait loci (edQTLs) and alternative polyadenylation QTLs (apaQTLs). We found colocalizations of a neuron edQTL in CCDC88A with educational attainment and a progenitor apaQTL in EP300 with schizophrenia, suggesting that genetically mediated post-transcriptional regulation during brain development leads to differences in brain function.
Collapse
Affiliation(s)
- Nil Aygün
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Celine Vuong
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Oleh Krupa
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jessica Mory
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Brandon D Le
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jordan M Valone
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Dan Liang
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Beck Shafie
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Pan Zhang
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Angelo Salinda
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Cindy Wen
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Michael J Gandal
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Michael I Love
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Luis de la Torre-Ubieta
- Intellectual and Developmental Disabilities Research Center, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Semel Institute for Neuroscience and Human Behavior, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA; Department of Psychiatry and Biobehavioral Sciences, Semel Institute, David Geffen School of Medicine University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Jason L Stein
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA; UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| |
Collapse
|
8
|
Holmes ME, Hertel KJ. Interdependent regulation of alternative splicing by SR and hnRNP proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.19.608666. [PMID: 39229091 PMCID: PMC11370404 DOI: 10.1101/2024.08.19.608666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
Alternative pre-mRNA splicing is a combinatorial process involving SR and hnRNP splicing factors. These proteins can silence or enhance splicing based on their expression levels and binding positions. To better understand their combinatorial and interdependent regulation, computational analyses were performed using HepG2 and K562 cell knockdown and binding datasets from the ENCODE Project. Analyses of diMerential splicing for 6 SR proteins and 13 hnRNP knockdowns revealed statistically significant exon overlap among most RBP combinations, albeit at diMerent levels. Neither SR proteins nor hnRNPs showed strong preferences for collaborating with specific RBP classes in mediating exon inclusion. While SRSF1, hnRNPK, and hnRNPC stand out as major influencers of alternative splicing, they do so predominantly independent of other RBPs. Meanwhile, minor influencers of alternative splicing such as hnRNPAB and hnRNPA0 predominantly regulate exon inclusion in concert with other RBPs, indicating that inclusion can be mediated by both single and multiple RBPs. Interestingly, the higher the number of RBPs that regulate the inclusion of an exon, the more variable exon inclusion preferences become. Interdependently regulated exons are more modular and have diMerent physical characteristics such as reduced exon length compared to their independent counterparts. A comparison of RBP interdependence between HepG2 and K562 cells provides the framework that explains cell-type-specific alternative splicing. Our study highlights the importance of the interdependent regulation of alternative exons and identifies characteristics of interdependently regulated exons that diMer from independently regulated exons.
Collapse
|
9
|
Liu L, Manley JL. Modulation of diverse biological processes by CPSF, the master regulator of mRNA 3' ends. RNA (NEW YORK, N.Y.) 2024; 30:1122-1140. [PMID: 38986572 PMCID: PMC11331416 DOI: 10.1261/rna.080108.124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 06/27/2024] [Indexed: 07/12/2024]
Abstract
The cleavage and polyadenylation specificity factor (CPSF) complex plays a central role in the formation of mRNA 3' ends, being responsible for the recognition of the poly(A) signal sequence, the endonucleolytic cleavage step, and recruitment of poly(A) polymerase. CPSF has been extensively studied for over three decades, and its functions and those of its individual subunits are becoming increasingly well-defined, with much current research focusing on the impact of these proteins on the normal functioning or disease/stress states of cells. In this review, we provide an overview of the general functions of CPSF and its subunits, followed by a discussion of how they exert their functions in a surprisingly diverse variety of biological processes and cellular conditions. These include transcription termination, small RNA processing, and R-loop prevention/resolution, as well as more generally cancer, differentiation/development, and infection/immunity.
Collapse
Affiliation(s)
- Lizhi Liu
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | - James L Manley
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| |
Collapse
|
10
|
Soles LV, Liu L, Zou X, Yoon Y, Li S, Tian L, Valdez MC, Yu A, Yin H, Li W, Ding F, Seelig G, Li L, Shi Y. A nuclear RNA degradation code for eukaryotic transcriptome surveillance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.23.604837. [PMID: 39211185 PMCID: PMC11361069 DOI: 10.1101/2024.07.23.604837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
The RNA exosome plays critical roles in eukaryotic RNA degradation, but it remains unclear how the exosome specifically recognizes its targets. The PAXT connection is an adaptor that recruits the exosome to polyadenylated RNAs in the nucleus, especially transcripts polyadenylated at intronic poly(A) sites. Here we show that PAXT-mediated RNA degradation is induced by the combination of a 5' splice site and a poly(A) junction, but not by either sequence alone. These sequences are bound by U1 snRNP and cleavage/polyadenylation factors, which in turn cooperatively recruit PAXT. As the 5' splice site-poly(A) junction combination is typically not found on correctly processed full-length RNAs, we propose that it functions as a "nuclear RNA degradation code" (NRDC). Importantly, disease-associated single nucleotide polymorphisms that create novel 5' splice sites in 3' untranslated regions can induce aberrant mRNA degradation via the NRDC mechanism. Together our study identified the first NRDC, revealed its recognition mechanism, and characterized its role in human diseases.
Collapse
|
11
|
Yang X, Chen X, Liu C, Wang Z, Lei W, Li Q, Zhao Y, Wang X. Dynamic Alternative Polyadenylation during Litopenaeus Vannamei Metamorphosis Development. Genes (Basel) 2024; 15:837. [PMID: 39062616 PMCID: PMC11275414 DOI: 10.3390/genes15070837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Revised: 06/21/2024] [Accepted: 06/24/2024] [Indexed: 07/28/2024] Open
Abstract
As an important mechanism in the post-transcriptional regulation of eukaryotic gene expression, alternative polyadenylation (APA) plays a key role in biological processes such as cell proliferation and differentiation. However, the role and dynamic pattern of APA during Litopenaeus vannamei metamorphosis are poorly understood. Here, RNA-seq data covering from the embryo to the maturation (16 time points) of L. vannamei were utilized. We identified 247 differentially expressed APA events between early and adult stages, and through fuzzy mean clustering analysis, we discovered five dynamic APA patterns. Among them, the gradual elongation of the 3'UTR is the major APA pattern that changes over time, and its genes are enriched in the pathways of protein and energy metabolism. Finally, we constructed mRNA-miRNA and PPI networks and detected several central miRNAs that may regulate L. vannamei development. Our results revealed the complex APA mechanisms in L. vannamei metamorphosis, shedding new light on post-transcriptional regulation of crustacean metamorphosis.
Collapse
Affiliation(s)
- Xueqin Yang
- China (Guangxi)-ASEAN Key Laboratory of Comprehensive Exploitation and Utilization of Aquatic Germplasm Resources, Ministry of Agriculture and Rural Affairs, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (X.Y.); (X.C.)
- College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China;
| | - Xiuli Chen
- China (Guangxi)-ASEAN Key Laboratory of Comprehensive Exploitation and Utilization of Aquatic Germplasm Resources, Ministry of Agriculture and Rural Affairs, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (X.Y.); (X.C.)
- Key Laboratory of Aquaculture Genetic and Breeding and Healthy Aquaculture of Guangxi, Guangxi Academy of Fishery Sciences, Nanning 530021, China;
| | - Chengzhang Liu
- CAS Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao 266071, China;
| | - Zezhong Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China;
| | - Wei Lei
- Department of Pharmaceutical and Graduate Life Sciences, College of Pharmacy, Natural & Health Sciences, Manchester University, Fort Wayne, IN 46845, USA;
| | - Qiangyong Li
- Key Laboratory of Aquaculture Genetic and Breeding and Healthy Aquaculture of Guangxi, Guangxi Academy of Fishery Sciences, Nanning 530021, China;
| | - Yongzhen Zhao
- China (Guangxi)-ASEAN Key Laboratory of Comprehensive Exploitation and Utilization of Aquatic Germplasm Resources, Ministry of Agriculture and Rural Affairs, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (X.Y.); (X.C.)
- Key Laboratory of Aquaculture Genetic and Breeding and Healthy Aquaculture of Guangxi, Guangxi Academy of Fishery Sciences, Nanning 530021, China;
| | - Xia Wang
- China (Guangxi)-ASEAN Key Laboratory of Comprehensive Exploitation and Utilization of Aquatic Germplasm Resources, Ministry of Agriculture and Rural Affairs, Guangxi Academy of Fishery Sciences, Nanning 530021, China; (X.Y.); (X.C.)
- College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China;
| |
Collapse
|
12
|
Patowary A, Zhang P, Jops C, Vuong CK, Ge X, Hou K, Kim M, Gong N, Margolis M, Vo D, Wang X, Liu C, Pasaniuc B, Li JJ, Gandal MJ, de la Torre-Ubieta L. Developmental isoform diversity in the human neocortex informs neuropsychiatric risk mechanisms. Science 2024; 384:eadh7688. [PMID: 38781356 DOI: 10.1126/science.adh7688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 03/13/2024] [Indexed: 05/25/2024]
Abstract
RNA splicing is highly prevalent in the brain and has strong links to neuropsychiatric disorders; yet, the role of cell type-specific splicing and transcript-isoform diversity during human brain development has not been systematically investigated. In this work, we leveraged single-molecule long-read sequencing to deeply profile the full-length transcriptome of the germinal zone and cortical plate regions of the developing human neocortex at tissue and single-cell resolution. We identified 214,516 distinct isoforms, of which 72.6% were novel (not previously annotated in Gencode version 33), and uncovered a substantial contribution of transcript-isoform diversity-regulated by RNA binding proteins-in defining cellular identity in the developing neocortex. We leveraged this comprehensive isoform-centric gene annotation to reprioritize thousands of rare de novo risk variants and elucidate genetic risk mechanisms for neuropsychiatric disorders.
Collapse
Affiliation(s)
- Ashok Patowary
- Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Intellectual and Developmental Disabilities Research Center, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Pan Zhang
- Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Intellectual and Developmental Disabilities Research Center, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Connor Jops
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Lifespan Brain Institute at Penn Med and the Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Celine K Vuong
- Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Intellectual and Developmental Disabilities Research Center, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Xinzhou Ge
- Department of Statistics, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Kangcheng Hou
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Minsoo Kim
- Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Naihua Gong
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Michael Margolis
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Daniel Vo
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Lifespan Brain Institute at Penn Med and the Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Xusheng Wang
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38103, USA
- Center for Proteomics and Metabolomics, St. Jude Children's Research Hospital, Memphis, TN 38105, USA
| | - Chunyu Liu
- Department of Psychiatry, SUNY Upstate Medical University, Syracuse, NY 13210, USA
- Center for Medical Genetics and Hunan Key Laboratory of Medical Genetics, School of Life Sciences, Central South University, Changsha, Hunan 410008, China
| | - Bogdan Pasaniuc
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Institute for Precision Health, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Jingyi Jessica Li
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Statistics, University of California Los Angeles, Los Angeles, CA 90095, USA
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Biostatistics, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Michael J Gandal
- Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Intellectual and Developmental Disabilities Research Center, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Lifespan Brain Institute at Penn Med and the Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Luis de la Torre-Ubieta
- Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
- Intellectual and Developmental Disabilities Research Center, Semel Institute for Neuroscience and Human Behavior, University of California Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
13
|
Liu S, Luo S, Yang D, Huang J, Jiang X, Yu S, Fu J, Zhou D, Chen X, He H, Fu H. Alternative polyadenylation profiles of susceptible and resistant rice (Oryza sativa L.) in response to bacterial leaf blight using RNA-seq. BMC PLANT BIOLOGY 2024; 24:145. [PMID: 38413866 PMCID: PMC10900630 DOI: 10.1186/s12870-024-04839-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2023] [Accepted: 02/20/2024] [Indexed: 02/29/2024]
Abstract
BACKGROUND Alternative polyadenylation (APA) is an important pattern of post-transcriptional regulation of genes widely existing in eukaryotes, involving plant physiological and pathological processes. However, there is a dearth of studies investigating the role of APA profile in rice leaf blight. RESULTS In this study, we compared the APA profile of leaf blight-susceptible varieties (CT 9737-613P-M) and resistant varieties (NSIC RC154) following bacterial blight infection. Through gene enrichment analysis, we found that the genes of two varieties typically exhibited distal poly(A) (PA) sites that play different roles in two kinds of rice, indicating differential APA regulatory mechanisms. In this process, many disease-resistance genes displayed multiple transcripts via APA. Moreover, we also found five polyadenylation factors of similar expression patterns of rice, highlighting the critical roles of these five factors in rice response to leaf blight about PA locus diversity. CONCLUSION Notably, the present study provides the first dynamic changes of APA in rice in early response to biotic stresses and proposes a possible functional conjecture of APA in plant immune response, which lays the theoretical foundation for in-depth determination of the role of APA events in plant stress response and other life processes.
Collapse
Affiliation(s)
- Shaochun Liu
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Shuqi Luo
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Dewei Yang
- Institute of Rice, Fujian Academy of Agricultural Sciences, Fuzhou, China
| | - Junying Huang
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Xinlei Jiang
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Shangwei Yu
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Junru Fu
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Dahu Zhou
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Xiaorong Chen
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China
| | - Haohua He
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China.
| | - Haihui Fu
- Key Laboratory of Crop Physiology, Ecology, and Genetic Breeding, Ministry of Education, Jiangxi Agricultural University, Nanchang, 330045, China.
| |
Collapse
|
14
|
Jonnakuti VS, Wagner EJ, Maletić-Savatić M, Liu Z, Yalamanchili HK. PolyAMiner-Bulk is a deep learning-based algorithm that decodes alternative polyadenylation dynamics from bulk RNA-seq data. CELL REPORTS METHODS 2024; 4:100707. [PMID: 38325383 PMCID: PMC10921021 DOI: 10.1016/j.crmeth.2024.100707] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/13/2023] [Accepted: 01/11/2024] [Indexed: 02/09/2024]
Abstract
Alternative polyadenylation (APA) is a key post-transcriptional regulatory mechanism; yet, its regulation and impact on human diseases remain understudied. Existing bulk RNA sequencing (RNA-seq)-based APA methods predominantly rely on predefined annotations, severely impacting their ability to decode novel tissue- and disease-specific APA changes. Furthermore, they only account for the most proximal and distal cleavage and polyadenylation sites (C/PASs). Deconvoluting overlapping C/PASs and the inherent noisy 3' UTR coverage in bulk RNA-seq data pose additional challenges. To overcome these limitations, we introduce PolyAMiner-Bulk, an attention-based deep learning algorithm that accurately recapitulates C/PAS sequence grammar, resolves overlapping C/PASs, captures non-proximal-to-distal APA changes, and generates visualizations to illustrate APA dynamics. Evaluation on multiple datasets strongly evinces the performance merit of PolyAMiner-Bulk, accurately identifying more APA changes compared with other methods. With the growing importance of APA and the abundance of bulk RNA-seq data, PolyAMiner-Bulk establishes a robust paradigm of APA analysis.
Collapse
Affiliation(s)
- Venkata Soumith Jonnakuti
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Program in Quantitative and Computational Biology, Baylor College of Medicine, Houston, TX 77030, USA; Medical Scientist Training Program, Baylor College of Medicine, Houston, TX 77030, USA
| | - Eric J Wagner
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Mirjana Maletić-Savatić
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Zhandong Liu
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Program in Quantitative and Computational Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Hari Krishna Yalamanchili
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; USDA/ARS Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA.
| |
Collapse
|
15
|
Omidsalar AA, McCullough CG, Xu L, Boedijono S, Gerke D, Webb MG, Manojlovic Z, Sequeira A, Lew MF, Santorelli M, Serrano GE, Beach TG, Limon A, Vawter MP, Hjelm BE. Common mitochondrial deletions in RNA-Seq: evaluation of bulk, single-cell, and spatial transcriptomic datasets. Commun Biol 2024; 7:200. [PMID: 38368460 PMCID: PMC10874445 DOI: 10.1038/s42003-024-05877-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 01/31/2024] [Indexed: 02/19/2024] Open
Abstract
Common mitochondrial DNA (mtDNA) deletions are large structural variants in the mitochondrial genome that accumulate in metabolically active tissues with age and have been investigated in various diseases. We applied the Splice-Break2 pipeline (designed for high-throughput quantification of mtDNA deletions) to human RNA-Seq datasets and describe the methodological considerations for evaluating common deletions in bulk, single-cell, and spatial transcriptomics datasets. A robust evaluation of 1570 samples from 14 RNA-Seq studies showed: (i) the abundance of some common deletions detected in PCR-amplified mtDNA correlates with levels observed in RNA-Seq data; (ii) RNA-Seq library preparation method has a strong effect on deletion detection; (iii) deletions had a significant, positive correlation with age in brain and muscle; (iv) deletions were enriched in cortical grey matter, specifically in layers 3 and 5; and (v) brain regions with dopaminergic neurons (i.e., substantia nigra, ventral tegmental area, and caudate nucleus) had remarkable enrichment of common mtDNA deletions.
Collapse
Affiliation(s)
- Audrey A Omidsalar
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Carmel G McCullough
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Lili Xu
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Stanley Boedijono
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Daniel Gerke
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Michelle G Webb
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Zarko Manojlovic
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Adolfo Sequeira
- Department of Psychiatry and Human Behavior, University of California - Irvine (UCI) School of Medicine, Irvine, CA, USA
| | - Mark F Lew
- Department of Neurology, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Marco Santorelli
- Department of Stem Cell Biology and Regenerative Medicine, Keck School of Medicine of USC, Los Angeles, CA, USA
| | - Geidy E Serrano
- Banner Sun Health Research Institute (BSHRI), Sun City, AZ, USA
| | - Thomas G Beach
- Banner Sun Health Research Institute (BSHRI), Sun City, AZ, USA
| | - Agenor Limon
- Mitchell Center for Neurodegenerative Diseases, Department of Neurology, School of Medicine, University of Texas Medical Branch, Galveston, TX, USA
| | - Marquis P Vawter
- Department of Psychiatry and Human Behavior, University of California - Irvine (UCI) School of Medicine, Irvine, CA, USA
| | - Brooke E Hjelm
- Department of Translational Genomics, Keck School of Medicine of USC, Los Angeles, CA, USA.
| |
Collapse
|
16
|
Meng X, Li C, Hei Y, Zhou X, Zhou G. Comparative alternative polyadenylation profiles in differentiated adipocytes of subcutaneous and intramuscular fat tissue in cattle. Gene 2024; 894:147949. [PMID: 37918547 DOI: 10.1016/j.gene.2023.147949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 09/16/2023] [Accepted: 10/30/2023] [Indexed: 11/04/2023]
Abstract
Alternative polyadenylation (APA) is a key molecular mechanism involved in the post-transcriptional regulation of gene expression, which has been proven to play a critical role in cell differentiation. In the present study, we performed IVT-SAPAS sequencing to profile the dynamic changes of APA sites in bovine subcutaneous preadipocytes and intramuscular preadipocytes during adipogenesis. A total of 52621 high quality APA sites were identified in preadipocytes and adipocytes. Compared with preadipocytes, the increased usage of canonical AATAAA was observed in the cell-biased APA sites of adipocytes. Furthermore, 1933 and 2140 differentially expressed APA (DE-APA) sites, as well as 341 and 337 untranslated region-APA (UTR-APA) switching genes were identified in subcutaneous preadipocytes and intramuscular preadipocytes during adipogenesis, respectively. The UTR-APA switching genes showed divergent trends in preadipocytes, among which UTR-APA switching genes in intramuscular preadipocytes tended to use shorter 3'UTR for differentiation into mature adipocytes. APA events mediated by UTR-APA switching in intramuscular adipocytes were enriched in lipid synthesis and adipocyte differentiation. TRIB3, WWTR1, and INSIG1 played important roles in the differentiation of intramuscular preadipocytes. Briefly, our results provided new insights into understanding the mechanisms of bovine adipocyte differentiation.
Collapse
Affiliation(s)
- Xiangge Meng
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Key Laboratory of Swine Genetics and Breeding of Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Chengping Li
- College of Life Science, Liaocheng University, Liaocheng, China
| | - Yu Hei
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Key Laboratory of Swine Genetics and Breeding of Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Xiang Zhou
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction of Ministry of Education, Key Laboratory of Swine Genetics and Breeding of Ministry of Agriculture, College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, China; Hubei Hongshan Laboratory, Wuhan, China; Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Shenzhen, China; Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Guoli Zhou
- College of Life Science, Liaocheng University, Liaocheng, China.
| |
Collapse
|
17
|
Ge Y, Huang J, Chen R, Fu Y, Ling T, Ou X, Rong X, Cheng Y, Lin Y, Zhou F, Lu C, Yuan S, Xu A. Downregulation of CPSF6 leads to global mRNA 3' UTR shortening and enhanced antiviral immune responses. PLoS Pathog 2024; 20:e1012061. [PMID: 38416782 PMCID: PMC10927093 DOI: 10.1371/journal.ppat.1012061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 03/11/2024] [Accepted: 02/19/2024] [Indexed: 03/01/2024] Open
Abstract
Alternative polyadenylation (APA) is a widespread mechanism of gene regulation that generates mRNA isoforms with alternative 3' untranslated regions (3' UTRs). Our previous study has revealed the global 3' UTR shortening of host mRNAs through APA upon viral infection. However, how the dynamic changes in the APA landscape occur upon viral infection remains largely unknown. Here we further found that, the reduced protein abundance of CPSF6, one of the core 3' processing factors, promotes the usage of proximal poly(A) sites (pPASs) of many immune related genes in macrophages and fibroblasts upon viral infection. Shortening of the 3' UTR of these transcripts may improve their mRNA stability and translation efficiency, leading to the promotion of type I IFN (IFN-I) signalling-based antiviral immune responses. In addition, dysregulated expression of CPSF6 is also observed in many immune related physiological and pathological conditions, especially in various infections and cancers. Thus, the global APA dynamics of immune genes regulated by CPSF6, can fine-tune the antiviral response as well as the responses to other cellular stresses to maintain the tissue homeostasis, which may represent a novel regulatory mechanism for antiviral immunity.
Collapse
Affiliation(s)
- Yong Ge
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Jingrong Huang
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Rong Chen
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Yonggui Fu
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Tao Ling
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Xin Ou
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Xiaohui Rong
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Youxiang Cheng
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Yi Lin
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Fengyi Zhou
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Chuanjian Lu
- The Second Clinical College of Guangzhou University of Chinese Medicine, Guangzhou, China
- State Key Laboratory of Dampness Syndrome of Chinese Medicine, Guangdong-Hong Kong-Macau Joint Lab on Chinese Medicine and Immune Disease Research, The Second Affiliated Hospital of Guangzhou University of Chinese Medicine, Guangdong Provincial Academy of Chinese Medical Sciences, Guangzhou, China
| | - Shaochun Yuan
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, China
| | - Anlong Xu
- Guangdong Province Key Laboratory of Pharmaceutical Functional Genes, MOE Key Laboratory of Gene Function and Regulation, State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
- School of Life Sciences, Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|
18
|
Lee S, Aubee JI, Lai EC. Regulation of alternative splicing and polyadenylation in neurons. Life Sci Alliance 2023; 6:e202302000. [PMID: 37793776 PMCID: PMC10551640 DOI: 10.26508/lsa.202302000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Revised: 09/22/2023] [Accepted: 09/25/2023] [Indexed: 10/06/2023] Open
Abstract
Cell-type-specific gene expression is a fundamental feature of multicellular organisms and is achieved by combinations of regulatory strategies. Although cell-restricted transcription is perhaps the most widely studied mechanism, co-transcriptional and post-transcriptional processes are also central to the spatiotemporal control of gene functions. One general category of expression control involves the generation of multiple transcript isoforms from an individual gene, whose balance and cell specificity are frequently tightly regulated via diverse strategies. The nervous system makes particularly extensive use of cell-specific isoforms, specializing the neural function of genes that are expressed more broadly. Here, we review regulatory strategies and RNA-binding proteins that direct neural-specific isoform processing. These include various classes of alternative splicing and alternative polyadenylation events, both of which broadly diversify the neural transcriptome. Importantly, global alterations of splicing and alternative polyadenylation are characteristic of many neural pathologies, and recent genetic studies demonstrate how misregulation of individual neural isoforms can directly cause mutant phenotypes.
Collapse
Affiliation(s)
- Seungjae Lee
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA
| | - Joseph I Aubee
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA
| | - Eric C Lai
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, USA
| |
Collapse
|
19
|
Bryce-Smith S, Burri D, Gazzara MR, Herrmann CJ, Danecka W, Fitzsimmons CM, Wan YK, Zhuang F, Fansler MM, Fernández JM, Ferret M, Gonzalez-Uriarte A, Haynes S, Herdman C, Kanitz A, Katsantoni M, Marini F, McDonnel E, Nicolet B, Poon CL, Rot G, Schärfen L, Wu PJ, Yoon Y, Barash Y, Zavolan M. Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data. RNA (NEW YORK, N.Y.) 2023; 29:1839-1855. [PMID: 37816550 PMCID: PMC10653393 DOI: 10.1261/rna.079849.123] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 09/21/2023] [Indexed: 10/12/2023]
Abstract
The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, limitations, and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3'-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for continuous extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies, while the containers and reproducible workflows could easily be deployed and extended to evaluate new methods or data sets.
Collapse
Affiliation(s)
- Sam Bryce-Smith
- Department of Neuromuscular Diseases, UCL Queen Square Motor Neuron Disease Centre, UCL Queen Square Institute of Neurology, UCL, London WC1N 3BG, United Kingdom
| | - Dominik Burri
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Matthew R Gazzara
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Christina J Herrmann
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Weronika Danecka
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3FF, United Kingdom
| | - Christina M Fitzsimmons
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Yuk Kei Wan
- Genome Institute of Singapore, Buona Vista, Singapore 138672
- Yong Loo Lin School of Medicine, National University of Singapore, Kent Ridge, Singapore 119228
| | - Farica Zhuang
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Mervin M Fansler
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Graduate Studies, New York, New York 10065, USA
- Cancer Biology and Genetics, Sloan-Kettering Institute, MSKCC, New York, New York 10065, USA
| | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), 28029 Madrid, Spain
| | - Meritxell Ferret
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), 28029 Madrid, Spain
| | - Asier Gonzalez-Uriarte
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), 28029 Madrid, Spain
| | - Samuel Haynes
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3FF, United Kingdom
| | - Chelsea Herdman
- Department of Neurobiology, University of Utah, Salt Lake City, Utah 84132, USA
| | - Alexander Kanitz
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Maria Katsantoni
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg-University Mainz, 55118 Mainz, Germany
| | - Euan McDonnel
- Leeds Institute for Data Analytics, School of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9NL, United Kingdom
| | - Ben Nicolet
- Department of Hematopoiesis, Sanquin Research, Landsteiner Laboratory, Amsterdam UMC, University of Amsterdam, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, 3521 AL Utrecht, The Netherlands
| | - Chi-Lam Poon
- Graduate School of Medical Sciences, Weill Cornell Medicine, New York, New York 10065, USA
| | - Gregor Rot
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
- Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Leonard Schärfen
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Pin-Jou Wu
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, 72076 Tübingen, Germany
| | - Yoseop Yoon
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California Irvine, Irvine, California 92617, USA
| | - Yoseph Barash
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Mihaela Zavolan
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
20
|
Zukher I, Dujardin G, Sousa-Luís R, Proudfoot NJ. Elongation roadblocks mediated by dCas9 across human genes modulate transcription and nascent RNA processing. Nat Struct Mol Biol 2023; 30:1536-1548. [PMID: 37783853 PMCID: PMC10584677 DOI: 10.1038/s41594-023-01090-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 08/04/2023] [Indexed: 10/04/2023]
Abstract
Non-cleaving Cas9 (dCas9) is widely employed to manipulate specific gene loci, often with scant regard for unintended transcriptional effects. We demonstrate here that dCas9 mediates precise RNA polymerase II transcriptional pausing followed by transcription termination and potential alternative polyadenylation. By contrast, alternative splicing is unaffected, likely requiring more sustained alteration to elongation speed. The effect on transcription is orientation specific, with pausing only being induced when dCas9-associated guide RNA anneals to the non-template strand. Targeting the template strand induces minimal effects on transcription elongation and thus provides a neutral approach to recruit dCas9-linked effector domains to specific gene regions. In essence, we evaluate molecular effects of targeting dCas9 to mammalian transcription units. In so doing, we also provide new information on elongation by RNA polymerase II and coupled pre-mRNA processing.
Collapse
Affiliation(s)
- Inna Zukher
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK.
| | - Gwendal Dujardin
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Rui Sousa-Luís
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
| | - Nick J Proudfoot
- Sir William Dunn School of Pathology, University of Oxford, Oxford, UK.
| |
Collapse
|
21
|
Gunter HM, Idrisoglu S, Singh S, Han DJ, Ariens E, Peters JR, Wong T, Cheetham SW, Xu J, Rai SK, Feldman R, Herbert A, Marcellin E, Tropee R, Munro T, Mercer TR. mRNA vaccine quality analysis using RNA sequencing. Nat Commun 2023; 14:5663. [PMID: 37735471 PMCID: PMC10514319 DOI: 10.1038/s41467-023-41354-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 08/24/2023] [Indexed: 09/23/2023] Open
Abstract
The success of mRNA vaccines has been realised, in part, by advances in manufacturing that enabled billions of doses to be produced at sufficient quality and safety. However, mRNA vaccines must be rigorously analysed to measure their integrity and detect contaminants that reduce their effectiveness and induce side-effects. Currently, mRNA vaccines and therapies are analysed using a range of time-consuming and costly methods. Here we describe a streamlined method to analyse mRNA vaccines and therapies using long-read nanopore sequencing. Compared to other industry-standard techniques, VAX-seq can comprehensively measure key mRNA vaccine quality attributes, including sequence, length, integrity, and purity. We also show how direct RNA sequencing can analyse mRNA chemistry, including the detection of nucleoside modifications. To support this approach, we provide supporting software to automatically report on mRNA and plasmid template quality and integrity. Given these advantages, we anticipate that RNA sequencing methods, such as VAX-seq, will become central to the development and manufacture of mRNA drugs.
Collapse
Affiliation(s)
- Helen M Gunter
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Senel Idrisoglu
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Swati Singh
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Dae Jong Han
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Emily Ariens
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | | | - Ted Wong
- Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Seth W Cheetham
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Jun Xu
- Genome Innovation Hub, University of Queensland, Brisbane, QLD, Australia
| | - Subash Kumar Rai
- Genome Innovation Hub, University of Queensland, Brisbane, QLD, Australia
| | - Robert Feldman
- COVID19 Vaccine Corporation Limited (CVC), Auckland, New Zealand
| | - Andy Herbert
- COVID19 Vaccine Corporation Limited (CVC), Auckland, New Zealand
| | - Esteban Marcellin
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
| | - Romain Tropee
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Trent Munro
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia
- BASE facility, University of Queensland, Brisbane, QLD, Australia
| | - Tim R Mercer
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, QLD, Australia.
- BASE facility, University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
22
|
Aygün N, Krupa O, Mory J, Le B, Valone J, Liang D, Love MI, Stein JL. Genetics of cell-type-specific post-transcriptional gene regulation during human neurogenesis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.30.555019. [PMID: 37693528 PMCID: PMC10491258 DOI: 10.1101/2023.08.30.555019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
The function of some genetic variants associated with brain-relevant traits has been explained through colocalization with expression quantitative trait loci (eQTL) conducted in bulk post-mortem adult brain tissue. However, many brain-trait associated loci have unknown cellular or molecular function. These genetic variants may exert context-specific function on different molecular phenotypes including post-transcriptional changes. Here, we identified genetic regulation of RNA-editing and alternative polyadenylation (APA), within a cell-type-specific population of human neural progenitors and neurons. More RNA-editing and isoforms utilizing longer polyadenylation sequences were observed in neurons, likely due to higher expression of genes encoding the proteins mediating these post-transcriptional events. We also detected hundreds of cell-type-specific editing quantitative trait loci (edQTLs) and alternative polyadenylation QTLs (apaQTLs). We found colocalizations of a neuron edQTL in CCDC88A with educational attainment and a progenitor apaQTL in EP300 with schizophrenia, suggesting genetically mediated post-transcriptional regulation during brain development lead to differences in brain function.
Collapse
Affiliation(s)
- Nil Aygün
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Oleh Krupa
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jessica Mory
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Brandon Le
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jordan Valone
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Dan Liang
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Michael I. Love
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jason L. Stein
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Lead contact
| |
Collapse
|
23
|
Cui Y, Wang L, Ding Q, Shin J, Cassel J, Liu Q, Salvino JM, Tian B. Elevated pre-mRNA 3' end processing activity in cancer cells renders vulnerability to inhibition of cleavage and polyadenylation. Nat Commun 2023; 14:4480. [PMID: 37528120 PMCID: PMC10394034 DOI: 10.1038/s41467-023-39793-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/27/2023] [Indexed: 08/03/2023] Open
Abstract
Cleavage and polyadenylation (CPA) is responsible for 3' end processing of eukaryotic poly(A)+ RNAs and preludes transcriptional termination. JTE-607, which targets CPSF-73, is the first known CPA inhibitor (CPAi) in mammalian cells. Here we show that JTE-607 perturbs gene expression through both transcriptional readthrough and alternative polyadenylation (APA). Sensitive genes are associated with features similar to those previously identified for PCF11 knockdown, underscoring a unified transcriptomic signature of CPAi. The degree of inhibition of an APA site by JTE-607 correlates with its usage level and, consistently, cells with elevated CPA activities, such as those with induced overexpression of FIP1, display greater transcriptomic disturbances when treated with JTE-607. Moreover, JTE-607 causes S phase crisis and is hence synergistic with inhibitors of DNA damage repair pathways. Together, our data reveal CPA activity and proliferation rate as determinants of CPAi-mediated cell death, raising the possibility of using CPAi as an adjunct therapy to suppress certain cancers.
Collapse
Affiliation(s)
- Yange Cui
- Gene Expression and Regulation Program, and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, 19104, USA
| | - Luyang Wang
- Gene Expression and Regulation Program, and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, 19104, USA
| | - Qingbao Ding
- Gene Expression and Regulation Program, and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, 19104, USA
| | - Jihae Shin
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, 07103, USA
| | - Joel Cassel
- Molecular and Cellular Oncogenesis Program, The Wistar Institute, Philadelphia, PA, 19104, USA
| | - Qin Liu
- Molecular and Cellular Oncogenesis Program, The Wistar Institute, Philadelphia, PA, 19104, USA
| | - Joseph M Salvino
- Molecular and Cellular Oncogenesis Program, The Wistar Institute, Philadelphia, PA, 19104, USA
| | - Bin Tian
- Gene Expression and Regulation Program, and Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, 19104, USA.
| |
Collapse
|
24
|
Verwilt J, Mestdagh P, Vandesompele J. Artifacts and biases of the reverse transcription reaction in RNA sequencing. RNA (NEW YORK, N.Y.) 2023; 29:889-897. [PMID: 36990512 DOI: 10.1261/rna.079623.123] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
RNA sequencing has spurred a significant number of research areas in recent years. Most protocols rely on synthesizing a more stable complementary DNA (cDNA) copy of the RNA molecule during the reverse transcription reaction. The resulting cDNA pool is often wrongfully assumed to be quantitatively and molecularly similar to the original RNA input. Sadly, biases and artifacts confound the resulting cDNA mixture. These issues are often overlooked or ignored in the literature by those that rely on the reverse transcription process. In this review, we confront the reader with intra- and intersample biases and artifacts caused by the reverse transcription reaction during RNA sequencing experiments. To fight the reader's despair, we also provide solutions to most issues and inform on good RNA sequencing practices. We hope the reader can use this review to their advantage, thereby contributing to scientifically sound RNA studies.
Collapse
Affiliation(s)
- Jasper Verwilt
- OncoRNALab, Cancer Research Institute Ghent, 9000 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Center for Medical Genetics, Ghent University, 9000 Ghent, Belgium
| | - Pieter Mestdagh
- OncoRNALab, Cancer Research Institute Ghent, 9000 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Center for Medical Genetics, Ghent University, 9000 Ghent, Belgium
| | - Jo Vandesompele
- OncoRNALab, Cancer Research Institute Ghent, 9000 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Center for Medical Genetics, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
25
|
Bryce-Smith S, Burri D, Gazzara MR, Herrmann CJ, Danecka W, Fitzsimmons CM, Wan YK, Zhuang F, Fansler MM, Fernández JM, Ferret M, Gonzalez-Uriarte A, Haynes S, Herdman C, Kanitz A, Katsantoni M, Marini F, McDonnel E, Nicolet B, Poon CL, Rot G, Schärfen L, Wu PJ, Yoon Y, Barash Y, Zavolan M. Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.23.546284. [PMID: 37425672 PMCID: PMC10327023 DOI: 10.1101/2023.06.23.546284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, and limitations and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3'-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for seamless extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies. Furthermore, the containers and reproducible workflows generated in the course of this project can be seamlessly deployed and extended in the future to evaluate new methods or datasets.
Collapse
Affiliation(s)
- Sam Bryce-Smith
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Dominik Burri
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Matthew R. Gazzara
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Christina J. Herrmann
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Weronika Danecka
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
| | - Christina M. Fitzsimmons
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Yuk Kei Wan
- Genome Institute of Singapore, Buona Vista, Singapore
- National University of Singapore, Kent Ridge, Singapore
| | - Farica Zhuang
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, USA
| | - Mervin M. Fansler
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell GraduateStudies, New York, NY, USA
- Cancer Biology and Genetics, Sloan-Kettering Institute, MSKCC, New York, NY, USA
| | - José M. Fernández
- Barcelona Supercomputing Center, Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES)
| | - Meritxell Ferret
- Barcelona Supercomputing Center, Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES)
| | - Asier Gonzalez-Uriarte
- Barcelona Supercomputing Center, Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES)
| | - Samuel Haynes
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
| | | | - Alexander Kanitz
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Maria Katsantoni
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI) - UniversityMedical Center of the Johannes Gutenberg, University Mainz, Germany
| | - Euan McDonnel
- Leeds Institute for Data Analytics, School of Molecular and Cellular Biology, University of Leeds, United Kingdom
| | - Ben Nicolet
- Department of Hematopoiesis, Sanquin Research, Landsteiner Laboratory, AmsterdamUMC, University of Amsterdam, and Oncode Institute, Amsterdam, The Netherlands
| | | | - Gregor Rot
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute of Molecular Life Sciences, Zurich, Switzerland
| | - Leonard Schärfen
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven CT, USA
| | - Pin-Jou Wu
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, Germany
| | - Yoseop Yoon
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California Irvine, Irvine, California, USA
| | - Yoseph Barash
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, USA
| | - Mihaela Zavolan
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
26
|
Cao J, Kuyumcu-Martinez MN. Alternative polyadenylation regulation in cardiac development and cardiovascular disease. Cardiovasc Res 2023; 119:1324-1335. [PMID: 36657944 PMCID: PMC10262186 DOI: 10.1093/cvr/cvad014] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 11/01/2022] [Accepted: 11/28/2022] [Indexed: 01/21/2023] Open
Abstract
Cleavage and polyadenylation of pre-mRNAs is a necessary step for gene expression and function. Majority of human genes exhibit multiple polyadenylation sites, which can be alternatively used to generate different mRNA isoforms from a single gene. Alternative polyadenylation (APA) of pre-mRNAs is important for the proteome and transcriptome landscape. APA is tightly regulated during development and contributes to tissue-specific gene regulation. Mis-regulation of APA is linked to a wide range of pathological conditions. APA-mediated gene regulation in the heart is emerging as a new area of research. Here, we will discuss the impact of APA on gene regulation during heart development and in cardiovascular diseases. First, we will briefly review how APA impacts gene regulation and discuss molecular mechanisms that control APA. Then, we will address APA regulation during heart development and its dysregulation in cardiovascular diseases. Finally, we will discuss pre-mRNA targeting strategies to correct aberrant APA patterns of essential genes for the treatment or prevention of cardiovascular diseases. The RNA field is blooming due to advancements in RNA-based technologies. RNA-based vaccines and therapies are becoming the new line of effective and safe approaches for the treatment and prevention of human diseases. Overall, this review will be influential for understanding gene regulation at the RNA level via APA in the heart and will help design RNA-based tools for the treatment of cardiovascular diseases in the future.
Collapse
Affiliation(s)
- Jun Cao
- Faculty of Environment and Life, Beijing University of Technology, Xueyuan Road, Haidian District, Beijing 100124, PR China
| | - Muge N Kuyumcu-Martinez
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch, 301 University Blvd, Galveston, TX 77573, USA
- Department of Neurobiology, University of Texas Medical Branch, Galveston, TX 77555, USA
- Institute for Translational Sciences, University of Texas Medical Branch, 301 University Blvd, Galveston, TX 77573, USA
| |
Collapse
|
27
|
Vlasenok M, Margasyuk S, Pervouchine DD. Transcriptome sequencing suggests that pre-mRNA splicing counteracts widespread intronic cleavage and polyadenylation. NAR Genom Bioinform 2023; 5:lqad051. [PMID: 37260513 PMCID: PMC10227441 DOI: 10.1093/nargab/lqad051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 05/09/2023] [Accepted: 05/17/2023] [Indexed: 06/02/2023] Open
Abstract
Alternative splicing (AS) and alternative polyadenylation (APA) are two crucial steps in the post-transcriptional regulation of eukaryotic gene expression. Protocols capturing and sequencing RNA 3'-ends have uncovered widespread intronic polyadenylation (IPA) in normal and disease conditions, where it is currently attributed to stochastic variations in the pre-mRNA processing. Here, we took advantage of the massive amount of RNA-seq data generated by the Genotype Tissue Expression project (GTEx) to simultaneously identify and match tissue-specific expression of intronic polyadenylation sites with tissue-specific splicing. A combination of computational methods including the analysis of short reads with non-templated adenines revealed that APA events are more abundant in introns than in exons. While the rate of IPA in composite terminal exons and skipped terminal exons expectedly correlates with splicing, we observed a considerable fraction of IPA events that lack AS support and attributed them to spliced polyadenylated introns (SPI). We hypothesize that SPIs represent transient byproducts of a dynamic coupling between APA and AS, in which the spliceosome removes the intron while it is being cleaved and polyadenylated. These findings indicate that cotranscriptional pre-mRNA splicing could serve as a rescue mechanism to suppress premature transcription termination at intronic polyadenylation sites.
Collapse
Affiliation(s)
- Maria Vlasenok
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar 30, Moscow 121205, Russia
| | - Sergey Margasyuk
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar 30, Moscow 121205, Russia
| | - Dmitri D Pervouchine
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar 30, Moscow 121205, Russia
| |
Collapse
|
28
|
de Morree A, Rando TA. Regulation of adult stem cell quiescence and its functions in the maintenance of tissue integrity. Nat Rev Mol Cell Biol 2023; 24:334-354. [PMID: 36922629 PMCID: PMC10725182 DOI: 10.1038/s41580-022-00568-6] [Citation(s) in RCA: 55] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/29/2022] [Indexed: 03/18/2023]
Abstract
Adult stem cells are important for mammalian tissues, where they act as a cell reserve that supports normal tissue turnover and can mount a regenerative response following acute injuries. Quiescent stem cells are well established in certain tissues, such as skeletal muscle, brain, and bone marrow. The quiescent state is actively controlled and is essential for long-term maintenance of stem cell pools. In this Review, we discuss the importance of maintaining a functional pool of quiescent adult stem cells, including haematopoietic stem cells, skeletal muscle stem cells, neural stem cells, hair follicle stem cells, and mesenchymal stem cells such as fibro-adipogenic progenitors, to ensure tissue maintenance and repair. We discuss the molecular mechanisms that regulate the entry into, maintenance of, and exit from the quiescent state in mice. Recent studies revealed that quiescent stem cells have a discordance between RNA and protein levels, indicating the importance of post-transcriptional mechanisms, such as alternative polyadenylation, alternative splicing, and translation repression, in the control of stem cell quiescence. Understanding how these mechanisms guide stem cell function during homeostasis and regeneration has important implications for regenerative medicine.
Collapse
Affiliation(s)
- Antoine de Morree
- Department of Neurology and Neurological Science, Stanford University School of Medicine, Stanford, CA, USA.
- Paul F. Glenn Center for the Biology of Aging, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Biomedicine, Aarhus University, Aarhus, Denmark.
| | - Thomas A Rando
- Department of Neurology and Neurological Science, Stanford University School of Medicine, Stanford, CA, USA.
- Paul F. Glenn Center for the Biology of Aging, Stanford University School of Medicine, Stanford, CA, USA.
- Center for Tissue Regeneration, Repair, and Restoration, Veterans Affairs Palo Alto Health Care System, Palo Alto, CA, USA.
- Broad Stem Cell Research Center, University of California, Los Angeles, Los Angeles, CA, USA.
| |
Collapse
|
29
|
LaForce GR, Philippidou P, Schaffer AE. mRNA isoform balance in neuronal development and disease. WILEY INTERDISCIPLINARY REVIEWS. RNA 2023; 14:e1762. [PMID: 36123820 PMCID: PMC10024649 DOI: 10.1002/wrna.1762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 07/11/2022] [Accepted: 08/15/2022] [Indexed: 11/07/2022]
Abstract
Balanced mRNA isoform diversity and abundance are spatially and temporally regulated throughout cellular differentiation. The proportion of expressed isoforms contributes to cell type specification and determines key properties of the differentiated cells. Neurons are unique cell types with intricate developmental programs, characteristic cellular morphologies, and electrophysiological potential. Neuron-specific gene expression programs establish these distinctive cellular characteristics and drive diversity among neuronal subtypes. Genes with neuron-specific alternative processing are enriched in key neuronal functions, including synaptic proteins, adhesion molecules, and scaffold proteins. Despite the similarity of neuronal gene expression programs, each neuronal subclass can be distinguished by unique alternative mRNA processing events. Alternative processing of developmentally important transcripts alters coding and regulatory information, including interaction domains, transcript stability, subcellular localization, and targeting by RNA binding proteins. Fine-tuning of mRNA processing is essential for neuronal activity and maintenance. Thus, the focus of neuronal RNA biology research is to dissect the transcriptomic mechanisms that underlie neuronal homeostasis, and consequently, predispose neuronal subtypes to disease. This article is categorized under: RNA in Disease and Development > RNA in Disease RNA in Disease and Development > RNA in Development.
Collapse
Affiliation(s)
- Geneva R LaForce
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Polyxeni Philippidou
- Department of Neurosciences, Case Western Reserve University, Cleveland, Ohio, USA
| | - Ashleigh E Schaffer
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| |
Collapse
|
30
|
Cui Y, Arnold FJ, Peng F, Wang D, Li JS, Michels S, Wagner EJ, La Spada AR, Li W. Alternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders. Nat Commun 2023; 14:583. [PMID: 36737438 PMCID: PMC9898543 DOI: 10.1038/s41467-023-36311-8] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 01/25/2023] [Indexed: 02/05/2023] Open
Abstract
Alternative polyadenylation (APA) plays an essential role in brain development; however, current transcriptome-wide association studies (TWAS) largely overlook APA in nominating susceptibility genes. Here, we performed a 3' untranslated region (3'UTR) APA TWAS (3'aTWAS) for 11 brain disorders by combining their genome-wide association studies data with 17,300 RNA-seq samples across 2,937 individuals. We identified 354 3'aTWAS-significant genes, including known APA-linked risk genes, such as SNCA in Parkinson's disease. Among these 354 genes, ~57% are not significant in traditional expression- and splicing-TWAS studies, since APA may regulate the translation, localization and protein-protein interaction of the target genes independent of mRNA level expression or splicing. Furthermore, we discovered ATXN3 as a 3'aTWAS-significant gene for amyotrophic lateral sclerosis, and its modulation substantially impacted pathological hallmarks of amyotrophic lateral sclerosis in vitro. Together, 3'aTWAS is a powerful strategy to nominate important APA-linked brain disorder susceptibility genes, most of which are largely overlooked by conventional expression and splicing analyses.
Collapse
Affiliation(s)
- Ya Cui
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA
| | - Frederick J Arnold
- Departments of Pathology & Laboratory Medicine, Neurology, and Biological Chemistry, School of Medicine, and the UCI Institute for Neurotherapeutics, University of California Irvine, Irvine, CA, 92697, USA
| | - Fanglue Peng
- Department of Molecular and Cellular Biology, University Baylor College of Medicine, Houston, TX, 77030, USA
| | - Dan Wang
- Department of Medicine, Division of Cardiology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Jason Sheng Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA
| | - Sebastian Michels
- Departments of Pathology & Laboratory Medicine, Neurology, and Biological Chemistry, School of Medicine, and the UCI Institute for Neurotherapeutics, University of California Irvine, Irvine, CA, 92697, USA
| | - Eric J Wagner
- School of Medicine and Dentistry, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Albert R La Spada
- Departments of Pathology & Laboratory Medicine, Neurology, and Biological Chemistry, School of Medicine, and the UCI Institute for Neurotherapeutics, University of California Irvine, Irvine, CA, 92697, USA.
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA.
| |
Collapse
|
31
|
Xiao S, Gu H, Deng L, Yang X, Qiao D, Zhang X, Zhang T, Yu T. Relationship between NUDT21 mediated alternative polyadenylation process and tumor. Front Oncol 2023; 13:1052012. [PMID: 36816917 PMCID: PMC9933127 DOI: 10.3389/fonc.2023.1052012] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Accepted: 01/11/2023] [Indexed: 02/05/2023] Open
Abstract
Alternative polyadenylation (APA) is a molecular process that generates diversity at the 3' end of RNA polymerase II transcripts from over 60% of human genes. APA and microRNA regulation are both mechanisms of post-transcriptional regulation of gene expression. As a key molecular mechanism, Alternative polyadenylation often results in mRNA isoforms with the same coding sequence but different lengths of 3' UTRs, while microRNAs regulate gene expression by binding to specific mRNA 3' UTRs. Nudix Hydrolase 21 (NUDT21) is a crucial mediator involved in alternative polyadenylation (APA). Different studies have reported a dual role of NUDT21 in cancer (both oncogenic and tumor suppressor). The present review focuses on the functions of APA, miRNA and their interaction and roles in development of different types of tumors.NUDT21 mediated 3' UTR-APA changes can be used to generate specific signatures that can be used as potential biomarkers in development and disease. Due to the emerging role of NUDT21 as a regulator of the aforementioned RNA processing events, modulation of NUDT21 levels may be a novel viable therapeutic approach.
Collapse
Affiliation(s)
- Shan Xiao
- Department of Oncology, Affiliated Hospital of Southwest Medical University of China, Luzhou, China
| | - Huan Gu
- Department of Head and Neck Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Li Deng
- Department of Oncology, Affiliated Hospital of Southwest Medical University of China, Luzhou, China
| | - Xiongtao Yang
- Department of Head and Neck Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Dan Qiao
- Department of Head and Neck Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Xudong Zhang
- Department of Anesthesia, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
| | - Tian Zhang
- Department of Head and Neck Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China,*Correspondence: Tao Yu, ; Tian Zhang,
| | - Tao Yu
- Department of Oncology, Affiliated Hospital of Southwest Medical University of China, Luzhou, China,Department of Head and Neck Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China,*Correspondence: Tao Yu, ; Tian Zhang,
| |
Collapse
|
32
|
Jonnakuti VS, Wagner EJ, Maletić-Savatić M, Liu Z, Yalamanchili HK. PolyAMiner-Bulk: A Machine Learning Based Bioinformatics Algorithm to Infer and Decode Alternative Polyadenylation Dynamics from bulk RNA-seq data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.23.523471. [PMID: 36747700 PMCID: PMC9900750 DOI: 10.1101/2023.01.23.523471] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
More than half of human genes exercise alternative polyadenylation (APA) and generate mRNA transcripts with varying 3' untranslated regions (UTR). However, current computational approaches for identifying cleavage and polyadenylation sites (C/PASs) and quantifying 3'UTR length changes from bulk RNA-seq data fail to unravel tissue- and disease-specific APA dynamics. Here, we developed a next-generation bioinformatics algorithm and application, PolyAMiner-Bulk, that utilizes an attention-based machine learning architecture and an improved vector projection-based engine to infer differential APA dynamics accurately. When applied to earlier studies, PolyAMiner-Bulk accurately identified more than twice the number of APA changes in an RBM17 knockdown bulk RNA-seq dataset compared to current generation tools. Moreover, on a separate dataset, PolyAMiner-Bulk revealed novel APA dynamics and pathways in scleroderma pathology and identified differential APA in a gene that was identified as being involved in scleroderma pathogenesis in an independent study. Lastly, we used PolyAMiner-Bulk to analyze the RNA-seq data of post-mortem prefrontal cortexes from the ROSMAP data consortium and unraveled novel APA dynamics in Alzheimer's Disease. Our method, PolyAMiner-Bulk, creates a paradigm for future alternative polyadenylation analysis from bulk RNA-seq data.
Collapse
Affiliation(s)
- Venkata Soumith Jonnakuti
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA
- Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX, 77030, USA
- Program in Quantitative and Computational Biology, Baylor College of Medicine, Houston, TX, 77030, USA
- Medical Scientist Training Program, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Eric J. Wagner
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Mirjana Maletić-Savatić
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA
- Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX, 77030, USA
| | - Zhandong Liu
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA
- Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX, 77030, USA
- Program in Quantitative and Computational Biology, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Hari Krishna Yalamanchili
- Department of Pediatrics, Baylor College of Medicine, Houston, TX, 77030, USA
- Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX, 77030, USA
| |
Collapse
|
33
|
He Y, Wu N. Alternative Polyadenylation Results in mRNA Transcript Instability in Gestational Diabetes Mellitus. Diabetes Metab Syndr Obes 2023; 16:619-628. [PMID: 36915397 PMCID: PMC10008025 DOI: 10.2147/dmso.s400283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Accepted: 03/02/2023] [Indexed: 03/09/2023] Open
Abstract
OBJECTIVE To study the characteristics of selective polyadenylation (APA) in gestational diabetes mellitus (GDM) by poly(A) site sequencing and to explore the role of APA process in the pathogenesis of GDM. METHODS Three pregnant women diagnosed as GDM in our hospital were randomly selected as the GDM group, and three healthy pregnant women at the same time as the control group. The placental tissues of two groups of pregnant women after delivery were collected for high-throughput transcriptome sequencing (RNA-seq) and poly(A) site sequencing (PAS-seq) to screen differentially expressed genes and variable 3'UTR genes in GDM. Gene Ontology (GO) analysis and pathway analysis were used to analyze the functional classification and pathway of differential genes, and preliminarily explore the susceptible genes in GDM. RESULTS Compared with the control group, there were 202 TTS loci in the GDM group, including 103 genes with shortened TTS loci and 99 genes with delayed TTS loci. There were 57 genes with significant difference in TTS (P<0.05). Subsequently, we found that VCPIP1 and LGR4 were differentially expressed in RNA-seq. The genes in advance of TTS locus were enriched in biological processes such as cell development, protein transport and phosphorylation, signal transduction, etc. Delayed TTS genes are enriched in biological processes such as transcriptional regulation, cell migration and cycle, DNA repair and damage. CONCLUSION The abnormality of APA process may be involved in the occurrence and development of GDM. The genes with significantly different changes in TTS locus may become biomarkers or predictors for GDM to assess the incidence, disease progression and disease severity, and may also become potential targets for GDM treatment.
Collapse
Affiliation(s)
- Yujing He
- Department of Endocrinology, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China
| | - Na Wu
- Department of Endocrinology, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China
- Department of Medical Service Quality, Shengjing Hospital of China Medical University, Shenyang, People’s Republic of China
- Correspondence: Na Wu, Email
| |
Collapse
|
34
|
Gallicchio L, Olivares GH, Berry CW, Fuller MT. Regulation and function of alternative polyadenylation in development and differentiation. RNA Biol 2023; 20:908-925. [PMID: 37906624 PMCID: PMC10730144 DOI: 10.1080/15476286.2023.2275109] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/17/2023] [Indexed: 11/02/2023] Open
Abstract
Alternative processing of nascent mRNAs is widespread in eukaryotic organisms and greatly impacts the output of gene expression. Specifically, alternative cleavage and polyadenylation (APA) is a co-transcriptional molecular process that switches the polyadenylation site (PAS) at which a nascent mRNA is cleaved, resulting in mRNA isoforms with different 3'UTR length and content. APA can potentially affect mRNA translation efficiency, localization, stability, and mRNA seeded protein-protein interactions. APA naturally occurs during development and cellular differentiation, with around 70% of human genes displaying APA in particular tissues and cell types. For example, neurons tend to express mRNAs with long 3'UTRs due to preferential processing at PASs more distal than other PASs used in other cell types. In addition, changes in APA mark a variety of pathological states, including many types of cancer, in which mRNAs are preferentially cleaved at more proximal PASs, causing expression of mRNA isoforms with short 3'UTRs. Although APA has been widely reported, both the function of APA in development and the mechanisms that regulate the choice of 3'end cut sites in normal and pathogenic conditions are still poorly understood. In this review, we summarize current understanding of how APA is regulated during development and cellular differentiation and how the resulting change in 3'UTR content affects multiple aspects of gene expression. With APA being a widespread phenomenon, the advent of cutting-edge scientific techniques and the pressing need for in-vivo studies, there has never been a better time to delve into the intricate mechanisms of alternative cleavage and polyadenylation.
Collapse
Affiliation(s)
- Lorenzo Gallicchio
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, USA
| | - Gonzalo H. Olivares
- Escuela de Kinesiología, Facultad de Medicina y Ciencias de la Salud, Center for Integrative Biology (CIB), Universidad Mayor, Chile and Department of Neuroscience, Faculty of Medicine, Universidad de Chile, Santiago, Chile
| | | | - Margaret T. Fuller
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, USA
- Department of Genetics, Stanford University School of Medicine, Stanford, USA
| |
Collapse
|
35
|
Begik O, Diensthuber G, Liu H, Delgado-Tejedor A, Kontur C, Niazi AM, Valen E, Giraldez AJ, Beaudoin JD, Mattick JS, Novoa EM. Nano3P-seq: transcriptome-wide analysis of gene expression and tail dynamics using end-capture nanopore cDNA sequencing. Nat Methods 2023; 20:75-85. [PMID: 36536091 PMCID: PMC9834059 DOI: 10.1038/s41592-022-01714-w] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 11/03/2022] [Indexed: 12/24/2022]
Abstract
RNA polyadenylation plays a central role in RNA maturation, fate, and stability. In response to developmental cues, polyA tail lengths can vary, affecting the translation efficiency and stability of mRNAs. Here we develop Nanopore 3' end-capture sequencing (Nano3P-seq), a method that relies on nanopore cDNA sequencing to simultaneously quantify RNA abundance, tail composition, and tail length dynamics at per-read resolution. By employing a template-switching-based sequencing protocol, Nano3P-seq can sequence RNA molecule from its 3' end, regardless of its polyadenylation status, without the need for PCR amplification or ligation of RNA adapters. We demonstrate that Nano3P-seq provides quantitative estimates of RNA abundance and tail lengths, and captures a wide diversity of RNA biotypes. We find that, in addition to mRNA and long non-coding RNA, polyA tails can be identified in 16S mitochondrial ribosomal RNA in both mouse and zebrafish models. Moreover, we show that mRNA tail lengths are dynamically regulated during vertebrate embryogenesis at an isoform-specific level, correlating with mRNA decay. Finally, we demonstrate the ability of Nano3P-seq in capturing non-A bases within polyA tails of various lengths, and reveal their distribution during vertebrate embryogenesis. Overall, Nano3P-seq is a simple and robust method for accurately estimating transcript levels, tail lengths, and tail composition heterogeneity in individual reads, with minimal library preparation biases, both in the coding and non-coding transcriptome.
Collapse
Affiliation(s)
- Oguzhan Begik
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Faculty of Medicine, University of New South Wales, Sydney, New South Wales, Australia
| | - Gregor Diensthuber
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Huanle Liu
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Anna Delgado-Tejedor
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
- Universitat Pompeu Fabra, Barcelona, Spain
| | | | - Adnan Muhammad Niazi
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
| | | | - Jean-Denis Beaudoin
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, USA
| | - John S Mattick
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Eva Maria Novoa
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain.
- Universitat Pompeu Fabra, Barcelona, Spain.
| |
Collapse
|
36
|
Mitschka S, Mayr C. Context-specific regulation and function of mRNA alternative polyadenylation. Nat Rev Mol Cell Biol 2022; 23:779-796. [PMID: 35798852 PMCID: PMC9261900 DOI: 10.1038/s41580-022-00507-5] [Citation(s) in RCA: 134] [Impact Index Per Article: 44.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 02/08/2023]
Abstract
Alternative cleavage and polyadenylation (APA) is a widespread mechanism to generate mRNA isoforms with alternative 3' untranslated regions (UTRs). The expression of alternative 3' UTR isoforms is highly cell type specific and is further controlled in a gene-specific manner by environmental cues. In this Review, we discuss how the dynamic, fine-grained regulation of APA is accomplished by several mechanisms, including cis-regulatory elements in RNA and DNA and factors that control transcription, pre-mRNA cleavage and post-transcriptional processes. Furthermore, signalling pathways modulate the activity of these factors and integrate APA into gene regulatory programmes. Dysregulation of APA can reprogramme the outcome of signalling pathways and thus can control cellular responses to environmental changes. In addition to the regulation of protein abundance, APA has emerged as a major regulator of mRNA localization and the spatial organization of protein synthesis. This role enables the regulation of protein function through the addition of post-translational modifications or the formation of protein-protein interactions. We further discuss recent transformative advances in single-cell RNA sequencing and CRISPR-Cas technologies, which enable the mapping and functional characterization of alternative 3' UTRs in any biological context. Finally, we discuss new APA-based RNA therapeutics, including compounds that target APA in cancer and therapeutic genome editing of degenerative diseases.
Collapse
Affiliation(s)
- Sibylle Mitschka
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Christine Mayr
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| |
Collapse
|
37
|
Qi Y, Wang M, Jiang Q. PABPC1--mRNA stability, protein translation and tumorigenesis. Front Oncol 2022; 12:1025291. [PMID: 36531055 PMCID: PMC9753129 DOI: 10.3389/fonc.2022.1025291] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 11/08/2022] [Indexed: 09/29/2023] Open
Abstract
Mammalian poly A-binding proteins (PABPs) are highly conserved multifunctional RNA-binding proteins primarily involved in the regulation of mRNA translation and stability, of which PABPC1 is considered a central regulator of cytoplasmic mRNA homing and is involved in a wide range of physiological and pathological processes by regulating almost every aspect of RNA metabolism. Alterations in its expression and function disrupt intra-tissue homeostasis and contribute to the development of various tumors. There is increasing evidence that PABPC1 is aberrantly expressed in a variety of tumor tissues and cancers such as lung, gastric, breast, liver, and esophageal cancers, and PABPC1 might be used as a potential biomarker for tumor diagnosis, treatment, and clinical application in the future. In this paper, we review the abnormal expression, functional role, and molecular mechanism of PABPC1 in tumorigenesis and provide directions for further understanding the regulatory role of PABPC1 in tumor cells.
Collapse
Affiliation(s)
- Ya Qi
- Department of Gynecology and Obstetrics, Shengjing Hospital Affiliated of China Medical University, Shenyang, Liaoning, China
| | - Min Wang
- Department of Gynecology and Obstetrics, Shengjing Hospital Affiliated of China Medical University, Shenyang, Liaoning, China
| | - Qi Jiang
- Second Department of Clinical Medicine, China Medical University, Shenyang, Liaoning, China
| |
Collapse
|
38
|
Fahmi NA, Ahmed KT, Chang JW, Nassereddeen H, Fan D, Yong J, Zhang W. APA-Scan: detection and visualization of 3'-UTR alternative polyadenylation with RNA-seq and 3'-end-seq data. BMC Bioinformatics 2022; 23:396. [PMID: 36171568 PMCID: PMC9520800 DOI: 10.1186/s12859-022-04939-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 09/16/2022] [Indexed: 11/26/2022] Open
Abstract
Background The eukaryotic genome is capable of producing multiple isoforms from a gene by alternative polyadenylation (APA) during pre-mRNA processing. APA in the 3′-untranslated region (3′-UTR) of mRNA produces transcripts with shorter or longer 3′-UTR. Often, 3′-UTR serves as a binding platform for microRNAs and RNA-binding proteins, which affect the fate of the mRNA transcript. Thus, 3′-UTR APA is known to modulate translation and provides a mean to regulate gene expression at the post-transcriptional level. Current bioinformatics pipelines have limited capability in profiling 3′-UTR APA events due to incomplete annotations and a low-resolution analyzing power: widely available bioinformatics pipelines do not reference actionable polyadenylation (cleavage) sites but simulate 3′-UTR APA only using RNA-seq read coverage, causing false positive identifications. To overcome these limitations, we developed APA-Scan, a robust program that identifies 3′-UTR APA events and visualizes the RNA-seq short-read coverage with gene annotations.
Methods APA-Scan utilizes either predicted or experimentally validated actionable polyadenylation signals as a reference for polyadenylation sites and calculates the quantity of long and short 3′-UTR transcripts in the RNA-seq data. APA-Scan works in three major steps: (i) calculate the read coverage of the 3′-UTR regions of genes; (ii) identify the potential APA sites and evaluate the significance of the events among two biological conditions; (iii) graphical representation of user specific event with 3′-UTR annotation and read coverage on the 3′-UTR regions. APA-Scan is implemented in Python3. Source code and a comprehensive user’s manual are freely available at https://github.com/compbiolabucf/APA-Scan. Result APA-Scan was applied to both simulated and real RNA-seq datasets and compared with two widely used baselines DaPars and APAtrap. In simulation APA-Scan significantly improved the accuracy of 3′-UTR APA identification compared to the other baselines. The performance of APA-Scan was also validated by 3′-end-seq data and qPCR on mouse embryonic fibroblast cells. The experiments confirm that APA-Scan can detect unannotated 3′-UTR APA events and improve genome annotation. Conclusion APA-Scan is a comprehensive computational pipeline to detect transcriptome-wide 3′-UTR APA events. The pipeline integrates both RNA-seq and 3′-end-seq data information and can efficiently identify the significant events with a high-resolution short reads coverage plots. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04939-w.
Collapse
Affiliation(s)
- Naima Ahmed Fahmi
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL, 32816, USA
| | - Khandakar Tanvir Ahmed
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL, 32816, USA
| | - Jae-Woong Chang
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota Twin Cities, 420 Washington Ave. S.E., Minneapolis, MN, 55455, USA
| | - Heba Nassereddeen
- Department of Computer Engineering, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL, 32816, USA
| | - Deliang Fan
- School of Electrical, Computer and Energy Engineering, Arizona State University, 650 E Tyler Mall, Tempe, AZ, 85287, USA
| | - Jeongsik Yong
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota Twin Cities, 420 Washington Ave. S.E., Minneapolis, MN, 55455, USA.
| | - Wei Zhang
- Department of Computer Science, University of Central Florida, 4000 Central Florida Blvd, Orlando, FL, 32816, USA.
| |
Collapse
|
39
|
CYCLIN K down-regulation induces androgen receptor gene intronic polyadenylation, variant expression and PARP inhibitor vulnerability in castration-resistant prostate cancer. Proc Natl Acad Sci U S A 2022; 119:e2205509119. [PMID: 36129942 PMCID: PMC9522376 DOI: 10.1073/pnas.2205509119] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Expression of androgen receptor variants (AR-Vs) is implicated in the development of castration-resistant prostate cancer (PCa). Others have shown that androgen depletion or antiandrogen treatment induces AR-V expression in PCa cell lines, xenografts, and patient samples, although the underlying mechanism remains unclear. Our findings reveal that hormonal therapy–induced CYCLIN K down-regulation represents a key mechanism that drives intronic polyadenylation (IPA) usage in the AR gene and AR-V expression and castration resistance in PCa, and that this mechanism of action can be therapeutically targeted by the PARP inhibitor. Androgen receptor (AR) messenger RNA (mRNA) alternative splicing variants (AR-Vs) are implicated in castration-resistant progression of prostate cancer (PCa), although the molecular mechanism underlying the genesis of AR-Vs remains poorly understood. The CDK12 gene is often deleted or mutated in PCa and CDK12 deficiency is known to cause homologous recombination repair gene alteration or BRCAness via alternative polyadenylation (APA). Here, we demonstrate that pharmacological inhibition or genetic inactivation of CDK12 induces AR gene intronic (intron 3) polyadenylation (IPA) usage, AR-V expression, and PCa cell resistance to the antiandrogen enzalutamide (ENZ). We further show that AR binds to the CCNK gene promoter and up-regulates CYCLIN K expression. In contrast, ENZ decreases AR occupancy at the CCNK gene promoter and suppresses CYCLIN K expression. Similar to the effect of the CDK12 inhibitor, CYCLIN K degrader or ENZ treatment promotes AR gene IPA usage, AR-V expression, and ENZ-resistant growth of PCa cells. Importantly, we show that targeting BRCAness induced by CYCLIN K down-regulation with the PARP inhibitor overcomes ENZ resistance. Our findings identify CYCLIN K down-regulation as a key driver of IPA usage, hormonal therapy–induced AR-V expression, and castration resistance in PCa. These results suggest that hormonal therapy–induced AR-V expression and therapy resistance are vulnerable to PARP inhibitor treatment.
Collapse
|
40
|
Lee S, Chen YC, Gillen AE, Taliaferro JM, Deplancke B, Li H, Lai EC. Diverse cell-specific patterns of alternative polyadenylation in Drosophila. Nat Commun 2022; 13:5372. [PMID: 36100597 PMCID: PMC9470587 DOI: 10.1038/s41467-022-32305-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 07/24/2022] [Indexed: 11/17/2022] Open
Abstract
Most genes in higher eukaryotes express isoforms with distinct 3' untranslated regions (3' UTRs), generated by alternative polyadenylation (APA). Since 3' UTRs are predominant locations of post-transcriptional regulation, APA can render such programs conditional, and can also alter protein sequences via alternative last exon (ALE) isoforms. We previously used 3'-sequencing from diverse Drosophila samples to define multiple tissue-specific APA landscapes. Here, we exploit comprehensive single nucleus RNA-sequencing data (Fly Cell Atlas) to elucidate cell-type expression of 3' UTRs across >250 adult Drosophila cell types. We reveal the cellular bases of multiple tissue-specific APA/ALE programs, such as 3' UTR lengthening in differentiated neurons and 3' UTR shortening in spermatocytes and spermatids. We trace dynamic 3' UTR patterns across cell lineages, including in the male germline, and discover new APA patterns in the intestinal stem cell lineage. Finally, we correlate expression of RNA binding proteins (RBPs), miRNAs and global levels of cleavage and polyadenylation (CPA) factors in several cell types that exhibit characteristic APA landscapes, yielding candidate regulators of transcriptome complexity. These analyses provide a comprehensive foundation for future investigations of mechanisms and biological impacts of alternative 3' isoforms across the major cell types of this widely-studied model organism.
Collapse
Affiliation(s)
- Seungjae Lee
- Developmental Biology Program, Sloan Kettering Institute, 1275 York Ave, Box 252, New York, NY, 10065, USA
| | - Yen-Chung Chen
- Department of Biology, New York University, New York, NY, 10013, USA
| | | | - Austin E Gillen
- Division of Hematology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.,Rocky Mountain Regional VA Medical Center, Aurora, CO, USA.,RNA Bioscience Initiative, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - J Matthew Taliaferro
- RNA Bioscience Initiative, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.,Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bio-engineering & Global Health Institute, School of Life Sciences, EPFL, CH-1015, Lausanne, Switzerland
| | - Hongjie Li
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Eric C Lai
- Developmental Biology Program, Sloan Kettering Institute, 1275 York Ave, Box 252, New York, NY, 10065, USA.
| |
Collapse
|
41
|
NBBt-test: a versatile method for differential analysis of multiple types of RNA-seq data. Sci Rep 2022; 12:12833. [PMID: 35896555 PMCID: PMC9329447 DOI: 10.1038/s41598-022-15762-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 06/29/2022] [Indexed: 11/25/2022] Open
Abstract
Rapid development of transcriptome sequencing technologies has resulted in a data revolution and emergence of new approaches to study transcriptomic regulation such as alternative splicing, alternative polyadenylation, CRISPR knockout screening in addition to the regular gene expression. A full characterization of the transcriptional landscape of different groups of cells or tissues holds enormous potential for both basic science as well as clinical applications. Although many methods have been developed in the realm of differential gene expression analysis, they all geared towards a particular type of sequencing data and failed to perform well when applied in different types of transcriptomic data. To fill this gap, we offer a negative beta binomial t-test (NBBt-test). NBBt-test provides multiple functions to perform differential analyses of alternative splicing, polyadenylation, CRISPR knockout screening, and gene expression datasets. Both real and large-scale simulation data show superior performance of NBBt-test with higher efficiency, and lower type I error rate and FDR to identify differential isoforms and differentially expressed genes and differential CRISPR knockout screening genes with different sample sizes when compared against the current very popular statistical methods. An R-package implementing NBBt-test is available for downloading from CRAN (https://CRAN.R-project.org/package=NBBttest).
Collapse
|
42
|
Dai X, Shen L. Advances and Trends in Omics Technology Development. Front Med (Lausanne) 2022; 9:911861. [PMID: 35860739 PMCID: PMC9289742 DOI: 10.3389/fmed.2022.911861] [Citation(s) in RCA: 130] [Impact Index Per Article: 43.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Accepted: 05/09/2022] [Indexed: 12/11/2022] Open
Abstract
The human history has witnessed the rapid development of technologies such as high-throughput sequencing and mass spectrometry that led to the concept of “omics” and methodological advancement in systematically interrogating a cellular system. Yet, the ever-growing types of molecules and regulatory mechanisms being discovered have been persistently transforming our understandings on the cellular machinery. This renders cell omics seemingly, like the universe, expand with no limit and our goal toward the complete harness of the cellular system merely impossible. Therefore, it is imperative to review what has been done and is being done to predict what can be done toward the translation of omics information to disease control with minimal cell perturbation. With a focus on the “four big omics,” i.e., genomics, transcriptomics, proteomics, metabolomics, we delineate hierarchies of these omics together with their epiomics and interactomics, and review technologies developed for interrogation. We predict, among others, redoxomics as an emerging omics layer that views cell decision toward the physiological or pathological state as a fine-tuned redox balance.
Collapse
|
43
|
A survey of transcriptome complexity using full-length isoform sequencing in the tea plant Camellia sinensis. Mol Genet Genomics 2022; 297:1243-1255. [PMID: 35763065 DOI: 10.1007/s00438-022-01913-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 05/29/2022] [Indexed: 10/17/2022]
Abstract
Tea is one of the most popular beverages and its leaves are rich in catechins, contributing to the diverse flavor as well as beneficial for human health. However, the study of the post-transcriptional regulatory mechanism affecting the synthesis of catechins remains insufficient. Here, we sequenced the transcriptome using PacBio sequencing technology and obtained 63,111 full-length high-quality isoforms, including 1302 potential novel genes and 583 highly reliable fusion transcripts. We also identified 1204 lncRNAs with high quality, containing 188 known and 1016 novel lncRNAs. In addition, 311 mis-annotated genes were corrected based on the high-quality Isoseq reads. A large number of alternative splicing (AS) events (3784) and alternative polyadenylation (APA) genes (18,714) were analyzed, accounting for 8.84% and 43.7% of the total annotated genes, respectively. We also found that 2884 genes containing AS and APA features exhibited higher expression levels than other genes. These genes are mainly involved in amino acid biosynthesis, carbon fixation in photosynthetic organisms, phenylalanine, tyrosine, tryptophan biosynthesis, and pyruvate metabolism, suggesting that they play an essential role in the catechins content of tea polyphenols. Our results further improved the level of genome annotation and indicated that post-transcriptional regulation plays a crucial part in synthesizing catechins.
Collapse
|
44
|
Guvenek A, Shin J, De Filippis L, Zheng D, Wang W, Pang ZP, Tian B. Neuronal Cells Display Distinct Stability Controls of Alternative Polyadenylation mRNA Isoforms, Long Non-Coding RNAs, and Mitochondrial RNAs. Front Genet 2022; 13:840369. [PMID: 35664307 PMCID: PMC9159357 DOI: 10.3389/fgene.2022.840369] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 02/28/2022] [Indexed: 11/25/2022] Open
Abstract
RNA stability plays an important role in gene expression. Here, using 3' end sequencing of newly made and pre-existing poly(A)+ RNAs, we compare transcript stability in multiple human cell lines, including HEK293T, HepG2, and SH-SY5Y. We show that while mRNA stability is generally conserved across the cell lines, specific transcripts having a high GC content and possibly more stable secondary RNA structures are relatively more stable in SH-SY5Y cells compared to the other 2 cell lines. These features also differentiate stability levels of alternative polyadenylation (APA) 3'UTR isoforms in a cell type-specific manner. Using differentiation of a neural stem cell line as a model, we show that mRNA stability difference could contribute to gene expression changes in neurogenesis and confirm the neuronal identity of SH-SY5Y cells at both gene expression and APA levels. In addition, compared to transcripts using 3'-most exon cleavage/polyadenylation sites (PASs), those using intronic PASs are generally less stable, especially when the PAS-containing intron is large and has a strong 5' splice site, suggesting that intronic polyadenylation mostly plays a negative role in gene expression. Interestingly, the differential mRNA stability among APA isoforms appears to buffer PAS choice in these cell lines. Moreover, we found that several other poly(A)+ RNA species, including promoter-associated long noncoding RNAs and transcripts encoded by the mitochondrial genome, are more stable in SH-SY5Y cells than the other 2 cell lines, further highlighting distinct RNA metabolism in neuronal cells. Together, our results indicate that distinct RNA stability control in neuronal cells may contribute to the gene expression and APA programs that define their cell identity.
Collapse
Affiliation(s)
- Aysegul Guvenek
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
- Rutgers School of Graduate Studies, Newark, NJ, United States
| | - Jihae Shin
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
| | - Lidia De Filippis
- Department of Neuroscience and Cell Biology, Child Health Institute of New Jersey, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, United States
| | - Dinghai Zheng
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
| | - Wei Wang
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
| | - Zhiping P. Pang
- Department of Neuroscience and Cell Biology, Child Health Institute of New Jersey, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, United States
| | - Bin Tian
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
- Program in Gene Expression and Regulation, Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, United States
| |
Collapse
|
45
|
Solayman M, Litfin T, Singh J, Paliwal K, Zhou Y, Zhan J. Probing RNA structures and functions by solvent accessibility: an overview from experimental and computational perspectives. Brief Bioinform 2022; 23:bbac112. [PMID: 35348613 PMCID: PMC9116373 DOI: 10.1093/bib/bbac112] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2021] [Revised: 03/03/2022] [Accepted: 03/04/2022] [Indexed: 12/30/2022] Open
Abstract
Characterizing RNA structures and functions have mostly been focused on 2D, secondary and 3D, tertiary structures. Recent advances in experimental and computational techniques for probing or predicting RNA solvent accessibility make this 1D representation of tertiary structures an increasingly attractive feature to explore. Here, we provide a survey of these recent developments, which indicate the emergence of solvent accessibility as a simple 1D property, adding to secondary and tertiary structures for investigating complex structure-function relations of RNAs.
Collapse
Affiliation(s)
- Md Solayman
- Institute for Glycomics, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Thomas Litfin
- Institute for Glycomics, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
| | - Jaswinder Singh
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD 4111, Australia
| | - Kuldip Paliwal
- Signal Processing Laboratory, School of Engineering and Built Environment, Griffith University, Brisbane, QLD 4111, Australia
| | - Yaoqi Zhou
- Institute for Glycomics, Griffith University, Parklands Dr. Southport, QLD 4222, Australia
- Institute for Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518055, China
- Peking University Shenzhen Graduate School, Shenzhen 518055, China
| | - Jian Zhan
- Institute for Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen 518055, China
| |
Collapse
|
46
|
Bai Y, Qin Y, Fan Z, Morrison RM, Nam K, Zarour HM, Koldamova R, Padiath QS, Kim S, Park HJ. scMAPA: Identification of cell-type-specific alternative polyadenylation in complex tissues. Gigascience 2022; 11:giac033. [PMID: 35488860 PMCID: PMC9055853 DOI: 10.1093/gigascience/giac033] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 11/18/2021] [Accepted: 03/15/2022] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Alternative polyadenylation (APA) causes shortening or lengthening of the 3'-untranslated region (3'-UTR) of genes (APA genes) in diverse cellular processes such as cell proliferation and differentiation. To identify cell-type-specific APA genes in scRNA-Seq data, current bioinformatic methods have several limitations. First, they assume certain read coverage shapes in the scRNA-Seq data, which can be violated in multiple APA genes. Second, their identification is limited between 2 cell types and not directly applicable to the data of multiple cell types. Third, they do not control undesired source of variance, which potentially introduces noise to the cell-type-specific identification of APA genes. FINDINGS We developed a combination of a computational change-point algorithm and a statistical model, single-cell Multi-group identification of APA (scMAPA). To avoid the assumptions on the read coverage shape, scMAPA formulates a change-point problem after transforming the 3' biased scRNA-Seq data to represent the full-length 3'-UTR signal. To identify cell-type-specific APA genes while adjusting for undesired source of variation, scMAPA models APA isoforms in consideration of the cell types and the undesired source. In our novel simulation data and data from human peripheral blood mononuclear cells, scMAPA outperforms existing methods in sensitivity, robustness, and stability. In mouse brain data consisting of multiple cell types sampled from multiple regions, scMAPA identifies cell-type-specific APA genes, elucidating novel roles of APA for dividing immune cells and differentiated neuron cells and in multiple brain disorders. CONCLUSIONS scMAPA elucidates the cell-type-specific function of APA events and sheds novel insights into the functional roles of APA events in complex tissues.
Collapse
Affiliation(s)
- Yulong Bai
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Yidi Qin
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Zhenjiang Fan
- Department of Computer Science, School of Computing and Information, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Robert M Morrison
- Department of Medicine and Division of Hematology/Oncology, University of Pittsburgh, School of Medicine, Pittsburgh, PA 15213, USA
- Department of Immunology, University of Pittsburgh, School of Medicine, Pittsburgh, PA 15213, USA
- Department of Computational and Systems Biology, University of Pittsburgh Medical Center, Pittsburgh, PA 15213, USA
| | - KyongNyon Nam
- Department of Environmental and Occupational Health, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Hassane M Zarour
- Department of Medicine and Division of Hematology/Oncology, University of Pittsburgh, School of Medicine, Pittsburgh, PA 15213, USA
- Department of Immunology, University of Pittsburgh, School of Medicine, Pittsburgh, PA 15213, USA
| | - Radosveta Koldamova
- Department of Environmental and Occupational Health, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
| | - Quasar Saleem Padiath
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
- Department of Neurobiology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Soyeon Kim
- Department of Pediatrics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, 15224, USA
- Division of Pediatric Pulmonary Medicine, UPMC Children’s Hospital of Pittsburgh, Pittsburgh, PA 15224, USA
| | - Hyun Jung Park
- Department of Human Genetics, Graduate School of Public Health, University of Pittsburgh, Pittsburgh, PA 15261, USA
| |
Collapse
|
47
|
Alternative polyadenylation associated with prognosis and therapy in colorectal cancer. Sci Rep 2022; 12:7036. [PMID: 35487956 PMCID: PMC9054804 DOI: 10.1038/s41598-022-11089-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Accepted: 04/11/2022] [Indexed: 11/24/2022] Open
Abstract
Colorectal cancer (CRC) is among the most widely spread cancers globally. Aberrant alternative polyadenylation (APA) plays a role in cancer onset and its progression. Consequently, this study focused on highlighting the role of APA events and signals in the prognosis of patients with CRC. The APA events, RNA sequencing (RNA-seq), somatic mutations, copy number variants (CNVs), and clinical information of the CRC cohort were obtained from The Cancer Genome Atlas (TCGA) database and UCSC (University of California-Santa Cruz) Xena database. The whole set was sorted into two sets: a training set and a test set in a ratio of 7:3. 197 prognosis-related APA events were collected by performing univariate Cox regression signature in patients with CRC. Subsequently, a signature for APA events was established by least absolute shrinkage and selection operator (LASSO) and multivariate Cox analysis. The risk scores were measured for individual patients on the basis of the signature and patients were sorted into two groups; the high-risk group and the low-risk group as per their median risk scores. Kaplan–Meier curves, principal component analysis (PCA), and time-dependent receiver operator characteristic (ROC) curves revealed that the signature was able to predict patient prognosis effectively and further validation was provided in the test set and the whole set. The high-risk and low-risk groups displayed various distributions of mutations and CNVs. Tumor mutation burden (TMB) alone and in combination with the signature predicted the prognosis of CRC patients, but the gene frequencies of TMBs and CNVs did not change in the low- and high-risk groups. Moreover, immunotherapy and chemotherapy treatments showed different responses to PD-1 inhibitors and multiple chemotherapeutic agents in the low and high-risk groups based on the tumor immune dysfunction and exclusion (TIDE) and genomics of drugs sensitivity in cancer (GDSC) databases. This study may help in understanding the potential roles of APA in CRC, and the signature for prognosis-related APA events can work as a potential predictor for survival and treatment in patients with CRC.
Collapse
|
48
|
Leveraging omic features with F3UTER enables identification of unannotated 3'UTRs for synaptic genes. Nat Commun 2022; 13:2270. [PMID: 35477703 PMCID: PMC9046390 DOI: 10.1038/s41467-022-30017-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 03/18/2022] [Indexed: 11/08/2022] Open
Abstract
There is growing evidence for the importance of 3' untranslated region (3'UTR) dependent regulatory processes. However, our current human 3'UTR catalogue is incomplete. Here, we develop a machine learning-based framework, leveraging both genomic and tissue-specific transcriptomic features to predict previously unannotated 3'UTRs. We identify unannotated 3'UTRs associated with 1,563 genes across 39 human tissues, with the greatest abundance found in the brain. These unannotated 3'UTRs are significantly enriched for RNA binding protein (RBP) motifs and exhibit high human lineage-specificity. We find that brain-specific unannotated 3'UTRs are enriched for the binding motifs of important neuronal RBPs such as TARDBP and RBFOX1, and their associated genes are involved in synaptic function. Our data is shared through an online resource F3UTER ( https://astx.shinyapps.io/F3UTER/ ). Overall, our data improves 3'UTR annotation and provides additional insights into the mRNA-RBP interactome in the human brain, with implications for our understanding of neurological and neurodevelopmental diseases.
Collapse
|
49
|
Ghosh S, Ataman M, Bak M, Börsch A, Schmidt A, Buczak K, Martin G, Dimitriades B, Herrmann CJ, Kanitz A, Zavolan M. CFIm-mediated alternative polyadenylation remodels cellular signaling and miRNA biogenesis. Nucleic Acids Res 2022; 50:3096-3114. [PMID: 35234914 PMCID: PMC8989530 DOI: 10.1093/nar/gkac114] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 01/31/2022] [Accepted: 02/04/2022] [Indexed: 12/13/2022] Open
Abstract
The mammalian cleavage factor I (CFIm) has been implicated in alternative polyadenylation (APA) in a broad range of contexts, from cancers to learning deficits and parasite infections. To determine how the CFIm expression levels are translated into these diverse phenotypes, we carried out a multi-omics analysis of cell lines in which the CFIm25 (NUDT21) or CFIm68 (CPSF6) subunits were either repressed by siRNA-mediated knockdown or over-expressed from stably integrated constructs. We established that >800 genes undergo coherent APA in response to changes in CFIm levels, and they cluster in distinct functional classes related to protein metabolism. The activity of the ERK pathway traces the CFIm concentration, and explains some of the fluctuations in cell growth and metabolism that are observed upon CFIm perturbations. Furthermore, multiple transcripts encoding proteins from the miRNA pathway are targets of CFIm-dependent APA. This leads to an increased biogenesis and repressive activity of miRNAs at the same time as some 3′ UTRs become shorter and presumably less sensitive to miRNA-mediated repression. Our study provides a first systematic assessment of a core set of APA targets that respond coherently to changes in CFIm protein subunit levels (CFIm25/CFIm68). We describe the elicited signaling pathways downstream of CFIm, which improve our understanding of the key role of CFIm in integrating RNA processing with other cellular activities.
Collapse
Affiliation(s)
- Souvik Ghosh
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Meric Ataman
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland.,Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Maciej Bak
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland.,Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Anastasiya Börsch
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland.,Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Alexander Schmidt
- Proteomics Core Facility, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Katarzyna Buczak
- Proteomics Core Facility, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Georges Martin
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Beatrice Dimitriades
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Christina J Herrmann
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland.,Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Alexander Kanitz
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland.,Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| | - Mihaela Zavolan
- Computational and Systems Biology, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland.,Swiss Institute of Bioinformatics, Biozentrum, University of Basel, Spitalstrasse 41, 4056 Basel, Switzerland
| |
Collapse
|
50
|
Wei L, Lai EC. Regulation of the Alternative Neural Transcriptome by ELAV/Hu RNA Binding Proteins. Front Genet 2022; 13:848626. [PMID: 35281806 PMCID: PMC8904962 DOI: 10.3389/fgene.2022.848626] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 02/01/2022] [Indexed: 11/30/2022] Open
Abstract
The process of alternative polyadenylation (APA) generates multiple 3' UTR isoforms for a given locus, which can alter regulatory capacity and on occasion change coding potential. APA was initially characterized for a few genes, but in the past decade, has been found to be the rule for metazoan genes. While numerous differences in APA profiles have been catalogued across genetic conditions, perturbations, and diseases, our knowledge of APA mechanisms and biology is far from complete. In this review, we highlight recent findings regarding the role of the conserved ELAV/Hu family of RNA binding proteins (RBPs) in generating the broad landscape of lengthened 3' UTRs that is characteristic of neurons. We relate this to their established roles in alternative splicing, and summarize ongoing directions that will further elucidate the molecular strategies for neural APA, the in vivo functions of ELAV/Hu RBPs, and the phenotypic consequences of these regulatory paradigms in neurons.
Collapse
Affiliation(s)
- Lu Wei
- Key Laboratory of RNA Biology, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Eric C. Lai
- Developmental Biology Program, Sloan Kettering Institute, New York, NY, United States
| |
Collapse
|