1
|
Ni Z, Ahmed N, Nabeel-Shah S, Guo X, Pu S, Song J, Marcon E, Burke G, Tong AH, Chan K, Ha KH, Blencowe B, Moffat J, Greenblatt J. Identifying human pre-mRNA cleavage and polyadenylation factors by genome-wide CRISPR screens using a dual fluorescence readthrough reporter. Nucleic Acids Res 2024; 52:4483-4501. [PMID: 38587191 PMCID: PMC11077057 DOI: 10.1093/nar/gkae240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Revised: 01/29/2024] [Accepted: 04/02/2024] [Indexed: 04/09/2024] Open
Abstract
Messenger RNA precursors (pre-mRNA) generally undergo 3' end processing by cleavage and polyadenylation (CPA), which is specified by a polyadenylation site (PAS) and adjacent RNA sequences and regulated by a large variety of core and auxiliary CPA factors. To date, most of the human CPA factors have been discovered through biochemical and proteomic studies. However, genetic identification of the human CPA factors has been hampered by the lack of a reliable genome-wide screening method. We describe here a dual fluorescence readthrough reporter system with a PAS inserted between two fluorescent reporters. This system enables measurement of the efficiency of 3' end processing in living cells. Using this system in combination with a human genome-wide CRISPR/Cas9 library, we conducted a screen for CPA factors. The screens identified most components of the known core CPA complexes and other known CPA factors. The screens also identified CCNK/CDK12 as a potential core CPA factor, and RPRD1B as a CPA factor that binds RNA and regulates the release of RNA polymerase II at the 3' ends of genes. Thus, this dual fluorescence reporter coupled with CRISPR/Cas9 screens reliably identifies bona fide CPA factors and provides a platform for investigating the requirements for CPA in various contexts.
Collapse
Affiliation(s)
- Zuyao Ni
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Nujhat Ahmed
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Syed Nabeel-Shah
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Xinghua Guo
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Shuye Pu
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Jingwen Song
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Edyta Marcon
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Giovanni L Burke
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Amy Hin Yan Tong
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON Canada
| | - Katherine Chan
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON Canada
| | - Kevin C H Ha
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Benjamin J Blencowe
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Jason Moffat
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON Canada
- Institute for Biomaterials and Biomedical Engineering, University of Toronto, Toronto, ON Canada
| | - Jack F Greenblatt
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| |
Collapse
|
2
|
Shademan M, Mei H, van Engelen B, Ariyurek Y, Kloet S, Raz V. PABPN1 loss-of-function causes APA-shift in oculopharyngeal muscular dystrophy. HGG ADVANCES 2024; 5:100269. [PMID: 38213032 PMCID: PMC10840355 DOI: 10.1016/j.xhgg.2024.100269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 01/05/2024] [Accepted: 01/05/2024] [Indexed: 01/13/2024] Open
Abstract
Alternative polyadenylation (APA) at the 3' UTR of transcripts contributes to the cell transcriptome. APA is suppressed by the nuclear RNA-binding protein PABPN1. Aging-associated reduced PABPN1 levels in skeletal muscles lead to muscle wasting. Muscle weakness in oculopharyngeal muscular dystrophy (OPMD) is caused by short alanine expansion in PABPN1 exon1. The expanded PABPN1 forms nuclear aggregates, an OPMD hallmark. Whether the expanded PABPN1 affects APA and how it contributes to muscle pathology is unresolved. To investigate these questions, we developed a procedure including RNA library preparation and a simple pipeline calculating the APA-shift ratio as a readout for PABPN1 activity. Comparing APA-shift results to previously published PAS utilization and APA-shift results, we validated this procedure. The procedure was then applied on the OPMD cell model and on RNA from OPMD muscles. APA-shift was genome-wide in the mouse OPMD model, primarily affecting muscle transcripts. In OPMD individuals, APA-shift was enriched with muscle transcripts. In an OPMD cell model APA-shift was not significant. APA-shift correlated with reduced expression levels of a subset of PABPN1 isoforms, whereas the expression of the expanded PABPN1 did not correlate with APA-shift. PABPN1 activity is not affected by the expression of expanded PABPN1, but rather by reduced PABPN1 expression levels. In muscles, PABPN1 activity initially affects muscle transcripts. We suggest that muscle weakness in OPMD is caused by PABPN1 loss-of-function leading to APA-shift that primarily affects in muscle transcripts.
Collapse
Affiliation(s)
- Milad Shademan
- Department of Human Genetics, Leiden University Medical Centre, Leiden, the Netherlands
| | - Hailiang Mei
- Department of Biomedical Data Sciences, Leiden University Medical Centre, Leiden, the Netherlands
| | - Baziel van Engelen
- Department of Neurology, Radboud University Medical Centre, Nijmegen, the Netherlands
| | - Yavuz Ariyurek
- Department of Human Genetics, Leiden University Medical Centre, Leiden, the Netherlands
| | - Susan Kloet
- Department of Human Genetics, Leiden University Medical Centre, Leiden, the Netherlands
| | - Vered Raz
- Department of Human Genetics, Leiden University Medical Centre, Leiden, the Netherlands.
| |
Collapse
|
3
|
Srinivas T, Siqueira E, Guil S. Techniques for investigating lncRNA transcript functions in neurodevelopment. Mol Psychiatry 2024; 29:874-890. [PMID: 38145986 PMCID: PMC11176085 DOI: 10.1038/s41380-023-02377-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 12/05/2023] [Accepted: 12/12/2023] [Indexed: 12/27/2023]
Abstract
Long noncoding RNAs (lncRNAs) are sequences of 200 nucleotides or more that are transcribed from a large portion of the mammalian genome. While hypothesized to have a variety of biological roles, many lncRNAs remain largely functionally uncharacterized due to unique challenges associated with their investigation. For example, some lncRNAs overlap with other genomic loci, are expressed in a cell-type-specific manner, and/or are differentially processed at the post-transcriptional level. The mammalian CNS contains a vast diversity of lncRNAs, and lncRNAs are highly abundant in the mammalian brain. However, interrogating lncRNA function in models of the CNS, particularly in vivo, can be complex and challenging. Here we review the breadth of methods used to investigate lncRNAs in the CNS, their merits, and the understanding they can provide with respect to neurodevelopment and pathophysiology. We discuss remaining challenges in the field and provide recommendations to assay lncRNAs based on current methods.
Collapse
Affiliation(s)
- Tara Srinivas
- Josep Carreras Leukaemia Research Institute (IJC), 08916, Badalona, Barcelona, Catalonia, Spain
| | - Edilene Siqueira
- Josep Carreras Leukaemia Research Institute (IJC), 08916, Badalona, Barcelona, Catalonia, Spain
| | - Sonia Guil
- Josep Carreras Leukaemia Research Institute (IJC), 08916, Badalona, Barcelona, Catalonia, Spain.
- Germans Trias i Pujol Health Science Research Institute, 08916, Badalona, Barcelona, Catalonia, Spain.
| |
Collapse
|
4
|
Liu X, Chen H, Li Z, Yang X, Jin W, Wang Y, Zheng J, Li L, Xuan C, Yuan J, Yang Y. InPACT: a computational method for accurate characterization of intronic polyadenylation from RNA sequencing data. Nat Commun 2024; 15:2583. [PMID: 38519498 PMCID: PMC10960005 DOI: 10.1038/s41467-024-46875-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 03/12/2024] [Indexed: 03/25/2024] Open
Abstract
Alternative polyadenylation can occur in introns, termed intronic polyadenylation (IPA), has been implicated in diverse biological processes and diseases, as it can produce noncoding transcripts or transcripts with truncated coding regions. However, a reliable method is required to accurately characterize IPA. Here, we propose a computational method called InPACT, which allows for the precise characterization of IPA from conventional RNA-seq data. InPACT successfully identifies numerous previously unannotated IPA transcripts in human cells, many of which are translated, as evidenced by ribosome profiling data. We have demonstrated that InPACT outperforms other methods in terms of IPA identification and quantification. Moreover, InPACT applied to monocyte activation reveals temporally coordinated IPA events. Further application on single-cell RNA-seq data of human fetal bone marrow reveals the expression of several IPA isoforms in a context-specific manner. Therefore, InPACT represents a powerful tool for the accurate characterization of IPA from RNA-seq data.
Collapse
Affiliation(s)
- Xiaochuan Liu
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Hao Chen
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Zekun Li
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Xiaoxiao Yang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Wen Jin
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Yuting Wang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Jian Zheng
- Department of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Long Li
- Department of Immunology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China
| | - Chenghao Xuan
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China.
| | - Jiapei Yuan
- State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Institute of Hematology and Blood Diseases Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin, 300020, China.
- Tianjin Institutes of Health Science, Tianjin, 301600, China.
| | - Yang Yang
- The Province and Ministry Co-sponsored Collaborative Innovation Center for Medical Epigenetics, Tianjin Key Laboratory of Inflammatory Biology, The Second Hospital of Tianjin Medical University, Department of Bioinformatics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China.
- Department of Pharmacology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, 300070, China.
| |
Collapse
|
5
|
Seyres D, Gorka O, Schmidt R, Marone R, Zavolan M, Jeker LT. T helper cells exhibit a dynamic and reversible 3'-UTR landscape. RNA (NEW YORK, N.Y.) 2024; 30:418-434. [PMID: 38302256 PMCID: PMC10946431 DOI: 10.1261/rna.079897.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 01/16/2024] [Indexed: 02/03/2024]
Abstract
3' untranslated regions (3' UTRs) are critical elements of messenger RNAs, as they contain binding sites for RNA-binding proteins (RBPs) and microRNAs that affect various aspects of the RNA life cycle including transcript stability and cellular localization. In response to T cell receptor activation, T cells undergo massive expansion during the effector phase of the immune response and dynamically modify their 3' UTRs. Whether this serves to directly regulate the abundance of specific mRNAs or is a secondary effect of proliferation remains unclear. To study 3'-UTR dynamics in T helper cells, we investigated division-dependent alternative polyadenylation (APA). In addition, we generated 3' end UTR sequencing data from naive, activated, memory, and regulatory CD4+ T cells. 3'-UTR length changes were estimated using a nonnegative matrix factorization approach and were compared with those inferred from long-read PacBio sequencing. We found that APA events were transient and reverted after effector phase expansion. Using an orthogonal bulk RNA-seq data set, we did not find evidence of APA association with differential gene expression or transcript usage, indicating that APA has only a marginal effect on transcript abundance. 3'-UTR sequence analysis revealed conserved binding sites for T cell-relevant microRNAs and RBPs in the alternative 3' UTRs. These results indicate that poly(A) site usage could play an important role in the control of cell fate decisions and homeostasis.
Collapse
Affiliation(s)
- Denis Seyres
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| | - Oliver Gorka
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| | - Ralf Schmidt
- Computational and Systems Biology, Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Romina Marone
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| | - Mihaela Zavolan
- Computational and Systems Biology, Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, Biozentrum, University of Basel, 4056 Basel, Switzerland
| | - Lukas T Jeker
- Department of Biomedicine, Basel University Hospital and University of Basel, CH-4031 Basel, Switzerland
- Transplantation Immunology and Nephrology, Basel University Hospital, CH-4031 Basel, Switzerland
| |
Collapse
|
6
|
Li Y, Gong J, Sun Q, Vong EG, Cheng X, Wang B, Yuan Y, Jin L, Gamazon ER, Zhou D, Lai M, Zhang D. Alternative polyadenylation quantitative trait methylation mapping in human cancers provides clues into the molecular mechanisms of APA. Am J Hum Genet 2024; 111:562-583. [PMID: 38367620 PMCID: PMC10940021 DOI: 10.1016/j.ajhg.2024.01.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 01/19/2024] [Accepted: 01/22/2024] [Indexed: 02/19/2024] Open
Abstract
Genetic variants are involved in the orchestration of alternative polyadenylation (APA) events, while the role of DNA methylation in regulating APA remains unclear. We generated a comprehensive atlas of APA quantitative trait methylation sites (apaQTMs) across 21 different types of cancer (1,612 to 60,219 acting in cis and 4,448 to 142,349 in trans). Potential causal apaQTMs in non-cancer samples were also identified. Mechanistically, we observed a strong enrichment of cis-apaQTMs near polyadenylation sites (PASs) and both cis- and trans-apaQTMs in proximity to transcription factor (TF) binding regions. Through the integration of ChIP-signals and RNA-seq data from cell lines, we have identified several regulators of APA events, acting either directly or indirectly, implicating novel functions of some important genes, such as TCF7L2, which is known for its involvement in type 2 diabetes and cancers. Furthermore, we have identified a vast number of QTMs that share the same putative causal CpG sites with five different cancer types, underscoring the roles of QTMs, including apaQTMs, in the process of tumorigenesis. DNA methylation is extensively involved in the regulation of APA events in human cancers. In an attempt to elucidate the potential underlying molecular mechanisms of APA by DNA methylation, our study paves the way for subsequent experimental validations into the intricate biological functions of DNA methylation in APA regulation and the pathogenesis of human cancers. To present a comprehensive catalog of apaQTM patterns, we introduce the Pancan-apaQTM database, available at https://pancan-apaqtm-zju.shinyapps.io/pancanaQTM/.
Collapse
Affiliation(s)
- Yige Li
- Department of Pathology, and Department of Medical Oncology of the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China; Department of Pathology, Key Laboratory of Disease Proteomics of Zhejiang Province, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China; Department of Pathology, the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Jingwen Gong
- Department of Pathology, and Department of Medical Oncology of the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China; Department of Pathology, Key Laboratory of Disease Proteomics of Zhejiang Province, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China; Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, Zhejiang Province, China
| | - Qingrong Sun
- Department of Pathology, Key Laboratory of Disease Proteomics of Zhejiang Province, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China; Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, Zhejiang Province, China; Liangzhu Laboratory, Zhejiang University Medical Center, 1369 West Wenyi Road, Hangzhou, Zhejiang Province, China; College of Information Science and Technology, ZheJiang Shuren University, Hangzhou 310015, ZheJiang, China
| | - Eu Gene Vong
- Department of Pathology, Key Laboratory of Disease Proteomics of Zhejiang Province, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China; Department of Biochemistry and Genetics, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China; The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Xiaoqing Cheng
- Department of Pathology, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China
| | - Binghong Wang
- Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China
| | - Ying Yuan
- Department of Medical Oncology, the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China; Cancer Institute (Key Laboratory of Cancer Prevention and Intervention, Chinese National Ministry of Education), the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China
| | - Li Jin
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, Fudan University, Shanghai, China; Research Unit of Dissecting the Population Genetics and Developing New Technologies for Treatment and Prevention of Skin Phenotypes and Dermatological Diseases (2019RU058), Chinese Academy of Medical Sciences, Shanghai, China
| | - Eric R Gamazon
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University Medical Center, Nashville, TN, USA; Data Science Institute, Vanderbilt University Medical Center, Nashville, TN, USA; Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Dan Zhou
- Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA; School of Public Health and the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Maode Lai
- Department of Pathology, Key Laboratory of Disease Proteomics of Zhejiang Province, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China; Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, Zhejiang Province, China; Department of Pathology, Research Unit of Intelligence Classification of Tumor Pathology and Precision Therapy, Chinese Academy of Medical Sciences (2019RU042), Key Laboratory of Disease Proteomics of Zhejiang Province, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China.
| | - Dandan Zhang
- Department of Pathology, and Department of Medical Oncology of the Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China; Department of Pathology, Key Laboratory of Disease Proteomics of Zhejiang Province, School of Medicine, Zhejiang University, Hangzhou, Zhejiang Province, China; Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, Zhejiang Province, China; Department of Pathology, Research Unit of Intelligence Classification of Tumor Pathology and Precision Therapy, Chinese Academy of Medical Sciences (2019RU042), Key Laboratory of Disease Proteomics of Zhejiang Province, Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China.
| |
Collapse
|
7
|
Jonnakuti VS, Wagner EJ, Maletić-Savatić M, Liu Z, Yalamanchili HK. PolyAMiner-Bulk is a deep learning-based algorithm that decodes alternative polyadenylation dynamics from bulk RNA-seq data. CELL REPORTS METHODS 2024; 4:100707. [PMID: 38325383 PMCID: PMC10921021 DOI: 10.1016/j.crmeth.2024.100707] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/13/2023] [Accepted: 01/11/2024] [Indexed: 02/09/2024]
Abstract
Alternative polyadenylation (APA) is a key post-transcriptional regulatory mechanism; yet, its regulation and impact on human diseases remain understudied. Existing bulk RNA sequencing (RNA-seq)-based APA methods predominantly rely on predefined annotations, severely impacting their ability to decode novel tissue- and disease-specific APA changes. Furthermore, they only account for the most proximal and distal cleavage and polyadenylation sites (C/PASs). Deconvoluting overlapping C/PASs and the inherent noisy 3' UTR coverage in bulk RNA-seq data pose additional challenges. To overcome these limitations, we introduce PolyAMiner-Bulk, an attention-based deep learning algorithm that accurately recapitulates C/PAS sequence grammar, resolves overlapping C/PASs, captures non-proximal-to-distal APA changes, and generates visualizations to illustrate APA dynamics. Evaluation on multiple datasets strongly evinces the performance merit of PolyAMiner-Bulk, accurately identifying more APA changes compared with other methods. With the growing importance of APA and the abundance of bulk RNA-seq data, PolyAMiner-Bulk establishes a robust paradigm of APA analysis.
Collapse
Affiliation(s)
- Venkata Soumith Jonnakuti
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Program in Quantitative and Computational Biology, Baylor College of Medicine, Houston, TX 77030, USA; Medical Scientist Training Program, Baylor College of Medicine, Houston, TX 77030, USA
| | - Eric J Wagner
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, USA
| | - Mirjana Maletić-Savatić
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Zhandong Liu
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Program in Quantitative and Computational Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Hari Krishna Yalamanchili
- Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; USDA/ARS Children's Nutrition Research Center, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA.
| |
Collapse
|
8
|
Torres-Ulloa L, Calvo-Roitberg E, Pai AA. Genome-wide kinetic profiling of pre-mRNA 3' end cleavage. RNA (NEW YORK, N.Y.) 2024; 30:256-270. [PMID: 38164598 PMCID: PMC10870368 DOI: 10.1261/rna.079783.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Accepted: 12/13/2023] [Indexed: 01/03/2024]
Abstract
Cleavage and polyadenylation is necessary for the formation of mature mRNA molecules. The rate at which this process occurs can determine the temporal availability of mRNA for subsequent function throughout the cell and is likely tightly regulated. Despite advances in high-throughput approaches for global kinetic profiling of RNA maturation, genome-wide 3' end cleavage rates have never been measured. Here, we describe a novel approach to estimate the rates of cleavage, using metabolic labeling of nascent RNA, high-throughput sequencing, and mathematical modeling. Using in silico simulations of nascent RNA-seq data, we show that our approach can accurately and precisely estimate cleavage half-lives for both constitutive and alternative sites. We find that 3' end cleavage is fast on average, with half-lives under a minute, but highly variable across individual sites. Rapid cleavage is promoted by the presence of canonical sequence elements and an increased density of polyadenylation signals near a cleavage site. Finally, we find that cleavage rates are associated with the localization of RNA polymerase II at the end of a gene, and faster cleavage leads to quicker degradation of downstream readthrough RNA. Our findings shed light on the features important for efficient 3' end cleavage and the regulation of transcription termination.
Collapse
Affiliation(s)
- Leslie Torres-Ulloa
- RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, USA
| | - Ezequiel Calvo-Roitberg
- RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, USA
| | - Athma A Pai
- RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, Massachusetts 01605, USA
| |
Collapse
|
9
|
Zeng Y, Lovchykova A, Akiyama T, Liu C, Guo C, Jawahar VM, Sianto O, Calliari A, Prudencio M, Dickson DW, Petrucelli L, Gitler AD. TDP-43 nuclear loss in FTD/ALS causes widespread alternative polyadenylation changes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.22.575730. [PMID: 38328059 PMCID: PMC10849503 DOI: 10.1101/2024.01.22.575730] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
In frontotemporal dementia and amyotrophic lateral sclerosis, the RNA-binding protein TDP-43 is depleted from the nucleus. TDP-43 loss leads to cryptic exon inclusion but a role in other RNA processing events remains unresolved. Here, we show that loss of TDP-43 causes widespread changes in alternative polyadenylation, impacting expression of disease-relevant genes (e.g., ELP1, NEFL, and TMEM106B) and providing evidence that alternative polyadenylation is a new facet of TDP-43 pathology.
Collapse
Affiliation(s)
- Yi Zeng
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Tetsuya Akiyama
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Chang Liu
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Caiwei Guo
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Vidhya Maheswari Jawahar
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Odilia Sianto
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
| | - Anna Calliari
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Mercedes Prudencio
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Dennis W. Dickson
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Leonard Petrucelli
- Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
- Neuroscience Graduate Program, Mayo Clinic Graduate School of Biomedical Sciences, Jacksonville, FL, USA
| | - Aaron D. Gitler
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA
- Chan Zuckerberg Biohub – San Francisco, San Francisco, CA, USA
| |
Collapse
|
10
|
Bryce-Smith S, Burri D, Gazzara MR, Herrmann CJ, Danecka W, Fitzsimmons CM, Wan YK, Zhuang F, Fansler MM, Fernández JM, Ferret M, Gonzalez-Uriarte A, Haynes S, Herdman C, Kanitz A, Katsantoni M, Marini F, McDonnel E, Nicolet B, Poon CL, Rot G, Schärfen L, Wu PJ, Yoon Y, Barash Y, Zavolan M. Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data. RNA (NEW YORK, N.Y.) 2023; 29:1839-1855. [PMID: 37816550 PMCID: PMC10653393 DOI: 10.1261/rna.079849.123] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 09/21/2023] [Indexed: 10/12/2023]
Abstract
The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, limitations, and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3'-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for continuous extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies, while the containers and reproducible workflows could easily be deployed and extended to evaluate new methods or data sets.
Collapse
Affiliation(s)
- Sam Bryce-Smith
- Department of Neuromuscular Diseases, UCL Queen Square Motor Neuron Disease Centre, UCL Queen Square Institute of Neurology, UCL, London WC1N 3BG, United Kingdom
| | - Dominik Burri
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Matthew R Gazzara
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Christina J Herrmann
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Weronika Danecka
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3FF, United Kingdom
| | - Christina M Fitzsimmons
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Yuk Kei Wan
- Genome Institute of Singapore, Buona Vista, Singapore 138672
- Yong Loo Lin School of Medicine, National University of Singapore, Kent Ridge, Singapore 119228
| | - Farica Zhuang
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Mervin M Fansler
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell Graduate Studies, New York, New York 10065, USA
- Cancer Biology and Genetics, Sloan-Kettering Institute, MSKCC, New York, New York 10065, USA
| | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), 28029 Madrid, Spain
| | - Meritxell Ferret
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), 28029 Madrid, Spain
| | - Asier Gonzalez-Uriarte
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), 28029 Madrid, Spain
| | - Samuel Haynes
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh EH9 3FF, United Kingdom
| | - Chelsea Herdman
- Department of Neurobiology, University of Utah, Salt Lake City, Utah 84132, USA
| | - Alexander Kanitz
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Maria Katsantoni
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center of the Johannes Gutenberg-University Mainz, 55118 Mainz, Germany
| | - Euan McDonnel
- Leeds Institute for Data Analytics, School of Molecular and Cellular Biology, University of Leeds, Leeds LS2 9NL, United Kingdom
| | - Ben Nicolet
- Department of Hematopoiesis, Sanquin Research, Landsteiner Laboratory, Amsterdam UMC, University of Amsterdam, 1066 CX Amsterdam, The Netherlands
- Oncode Institute, 3521 AL Utrecht, The Netherlands
| | - Chi-Lam Poon
- Graduate School of Medical Sciences, Weill Cornell Medicine, New York, New York 10065, USA
| | - Gregor Rot
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
- Institute of Molecular Life Sciences, University of Zurich, 8057 Zurich, Switzerland
| | - Leonard Schärfen
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven, Connecticut 06520, USA
| | - Pin-Jou Wu
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, 72076 Tübingen, Germany
| | - Yoseop Yoon
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California Irvine, Irvine, California 92617, USA
| | - Yoseph Barash
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Mihaela Zavolan
- Biozentrum, University of Basel, 4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| |
Collapse
|
11
|
Carrion SA, Michal JJ, Jiang Z. Alternative Transcripts Diversify Genome Function for Phenome Relevance to Health and Diseases. Genes (Basel) 2023; 14:2051. [PMID: 38002994 PMCID: PMC10671453 DOI: 10.3390/genes14112051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/06/2023] [Accepted: 11/07/2023] [Indexed: 11/26/2023] Open
Abstract
Manipulation using alternative exon splicing (AES), alternative transcription start (ATS), and alternative polyadenylation (APA) sites are key to transcript diversity underlying health and disease. All three are pervasive in organisms, present in at least 50% of human protein-coding genes. In fact, ATS and APA site use has the highest impact on protein identity, with their ability to alter which first and last exons are utilized as well as impacting stability and translation efficiency. These RNA variants have been shown to be highly specific, both in tissue type and stage, with demonstrated importance to cell proliferation, differentiation and the transition from fetal to adult cells. While alternative exon splicing has a limited effect on protein identity, its ubiquity highlights the importance of these minor alterations, which can alter other features such as localization. The three processes are also highly interwoven, with overlapping, complementary, and competing factors, RNA polymerase II and its CTD (C-terminal domain) chief among them. Their role in development means dysregulation leads to a wide variety of disorders and cancers, with some forms of disease disproportionately affected by specific mechanisms (AES, ATS, or APA). Challenges associated with the genome-wide profiling of RNA variants and their potential solutions are also discussed in this review.
Collapse
Affiliation(s)
| | | | - Zhihua Jiang
- Department of Animal Sciences and Center for Reproductive Biology, Washington State University, Pullman, WA 99164-7620, USA; (S.A.C.); (J.J.M.)
| |
Collapse
|
12
|
Dahal S, Clayton K, Cabral T, Cheng R, Jahanshahi S, Ahmed C, Koirala A, Villasmil Ocando A, Malty R, Been T, Hernandez J, Mangos M, Shen D, Babu M, Calarco J, Chabot B, Attisano L, Houry WA, Cochrane A. On a path toward a broad-spectrum anti-viral: inhibition of HIV-1 and coronavirus replication by SR kinase inhibitor harmine. J Virol 2023; 97:e0039623. [PMID: 37706687 PMCID: PMC10617549 DOI: 10.1128/jvi.00396-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 07/14/2023] [Indexed: 09/15/2023] Open
Abstract
IMPORTANCE This study highlights the crucial role RNA processing plays in regulating viral gene expression and replication. By targeting SR kinases, we identified harmine as a potent inhibitor of HIV-1 as well as coronavirus (HCoV-229E and multiple SARS-CoV-2 variants) replication. Harmine inhibits HIV-1 protein expression and reduces accumulation of HIV-1 RNAs in both cell lines and primary CD4+ T cells. Harmine also suppresses coronavirus replication post-viral entry by preferentially reducing coronavirus sub-genomic RNA accumulation. By focusing on host factors rather than viral targets, our study offers a novel approach to combating viral infections that is effective against a range of unrelated viruses. Moreover, at doses required to inhibit virus replication, harmine had limited toxicity and minimal effect on the host transcriptome. These findings support the viability of targeting host cellular processes as a means of developing broad-spectrum anti-virals.
Collapse
Affiliation(s)
- Subha Dahal
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Kiera Clayton
- Department of Pathology, University of Massachusetts Medical School, Worcester, Massachusetts, USA
| | - Tyler Cabral
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Ran Cheng
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Shahrzad Jahanshahi
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
| | - Choudhary Ahmed
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Amrit Koirala
- Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, Texas, USA
- Dan L. Duncan Cancer Comprehensive Center, Baylor College of Medicine, Houston, Texas, USA
| | | | - Ramy Malty
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
- Research and Innovation Centre, Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - Terek Been
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Javier Hernandez
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
| | - Maria Mangos
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
| | - David Shen
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Mohan Babu
- Research and Innovation Centre, Department of Biochemistry, University of Regina, Regina, Saskatchewan, Canada
| | - John Calarco
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Benoit Chabot
- Department of Microbiology and Infectious Diseases, Université de Sherbrooke, Sherbrooke, Quebec, Canada
| | - Liliana Attisano
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
| | - Walid A. Houry
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Chemistry, University of Toronto, Toronto, Ontario, Canada
| | - Alan Cochrane
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
13
|
Soulette CM, Hrabeta-Robinson E, Arevalo C, Felton C, Tang AD, Marin MG, Brooks AN. Full-length transcript alterations in human bronchial epithelial cells with U2AF1 S34F mutations. Life Sci Alliance 2023; 6:e202000641. [PMID: 37487637 PMCID: PMC10366530 DOI: 10.26508/lsa.202000641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 06/30/2023] [Accepted: 07/03/2023] [Indexed: 07/26/2023] Open
Abstract
U2AF1 is one of the most recurrently mutated splicing factors in lung adenocarcinoma and has been shown to cause transcriptome-wide pre-mRNA splicing alterations; however, the full-length altered mRNA isoforms associated with the mutation are largely unknown. To better understand the impact U2AF1 has on full-length isoform fate and function, we conducted high-throughput long-read cDNA sequencing from isogenic human bronchial epithelial cells with and without a U2AF1 S34F mutation. We identified 49,366 multi-exon transcript isoforms, more than half of which did not match GENCODE or short-read-assembled isoforms. We found 198 transcript isoforms with significant expression and usage changes relative to WT, only 68% of which were assembled by short reads. Expression of isoforms from immune-related genes is largely down-regulated in mutant cells and without observed splicing changes. Finally, we reveal that isoforms likely targeted by nonsense-mediated decay are down-regulated in U2AF1 S34F cells, suggesting that isoform changes may alter the translational output of those affected genes. Altogether, our work provides a resource of full-length isoforms associated with U2AF1 S34F in lung cells.
Collapse
Affiliation(s)
- Cameron M Soulette
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Cruz, CA, USA
| | - Eva Hrabeta-Robinson
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Carlos Arevalo
- Department of Molecular, Cellular and Developmental Biology, University of California, Santa Cruz, CA, USA
| | - Colette Felton
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Alison D Tang
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Maximillian G Marin
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Angela N Brooks
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| |
Collapse
|
14
|
Zhang Z, Bae B, Cuddleston WH, Miura P. Coordination of alternative splicing and alternative polyadenylation revealed by targeted long read sequencing. Nat Commun 2023; 14:5506. [PMID: 37679364 PMCID: PMC10484994 DOI: 10.1038/s41467-023-41207-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 08/25/2023] [Indexed: 09/09/2023] Open
Abstract
Nervous system development is associated with extensive regulation of alternative splicing (AS) and alternative polyadenylation (APA). AS and APA have been extensively studied in isolation, but little is known about how these processes are coordinated. Here, the coordination of cassette exon (CE) splicing and APA in Drosophila was investigated using a targeted long-read sequencing approach we call Pull-a-Long-Seq (PL-Seq). This cost-effective method uses cDNA pulldown and Nanopore sequencing combined with an analysis pipeline to quantify inclusion of alternative exons in connection with alternative 3' ends. Using PL-Seq, we identified genes that exhibit significant differences in CE splicing depending on connectivity to short versus long 3'UTRs. Genomic long 3'UTR deletion was found to alter upstream CE splicing in short 3'UTR isoforms and ELAV loss differentially affected CE splicing depending on connectivity to alternative 3'UTRs. This work highlights the importance of considering connectivity to alternative 3'UTRs when monitoring AS events.
Collapse
Affiliation(s)
- Zhiping Zhang
- Department of Genetics and Genome Sciences, University of Connecticut School of Medicine, Farmington, CT, USA
- Department of Biology, University of Nevada, Reno, Reno, NV, USA
| | - Bongmin Bae
- Department of Biology, University of Nevada, Reno, Reno, NV, USA
| | | | - Pedro Miura
- Department of Genetics and Genome Sciences, University of Connecticut School of Medicine, Farmington, CT, USA.
- Department of Biology, University of Nevada, Reno, Reno, NV, USA.
- Institute for System Genomics, University of Connecticut, Storrs, CT, USA.
| |
Collapse
|
15
|
Aygün N, Krupa O, Mory J, Le B, Valone J, Liang D, Love MI, Stein JL. Genetics of cell-type-specific post-transcriptional gene regulation during human neurogenesis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.30.555019. [PMID: 37693528 PMCID: PMC10491258 DOI: 10.1101/2023.08.30.555019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
The function of some genetic variants associated with brain-relevant traits has been explained through colocalization with expression quantitative trait loci (eQTL) conducted in bulk post-mortem adult brain tissue. However, many brain-trait associated loci have unknown cellular or molecular function. These genetic variants may exert context-specific function on different molecular phenotypes including post-transcriptional changes. Here, we identified genetic regulation of RNA-editing and alternative polyadenylation (APA), within a cell-type-specific population of human neural progenitors and neurons. More RNA-editing and isoforms utilizing longer polyadenylation sequences were observed in neurons, likely due to higher expression of genes encoding the proteins mediating these post-transcriptional events. We also detected hundreds of cell-type-specific editing quantitative trait loci (edQTLs) and alternative polyadenylation QTLs (apaQTLs). We found colocalizations of a neuron edQTL in CCDC88A with educational attainment and a progenitor apaQTL in EP300 with schizophrenia, suggesting genetically mediated post-transcriptional regulation during brain development lead to differences in brain function.
Collapse
Affiliation(s)
- Nil Aygün
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Oleh Krupa
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jessica Mory
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Brandon Le
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jordan Valone
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Dan Liang
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Michael I. Love
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jason L. Stein
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- UNC Neuroscience Center University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Lead contact
| |
Collapse
|
16
|
Li X, Wang Y, Min Q, Zhang W, Teng H, Li C, Zhang K, Shi L, Wang B, Zhan Q. Comparative transcriptome characterization of esophageal squamous cell carcinoma and adenocarcinoma. Comput Struct Biotechnol J 2023; 21:3841-3853. [PMID: 37564101 PMCID: PMC10410469 DOI: 10.1016/j.csbj.2023.07.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 07/20/2023] [Accepted: 07/22/2023] [Indexed: 08/12/2023] Open
Abstract
Background Esophageal cancers are primarily categorized as esophageal squamous cell carcinoma (ESCC) and esophageal adenocarcinoma (EAC). While various (epi) genomic alterations associated with tumor development in ESCC and EAC have been documented, a comprehensive comparison of the transcriptomes in these two cancer subtypes remains lacking. Methods We collected 551 gene expression profiles from publicly available sources, including normal, ESCC, and EAC tissues or cell lines. Subsequently, we conducted a systematic analysis to compare the transcriptomes of these samples at various levels, including gene expression, promoter activity, alternative splicing (AS), alternative polyadenylation (APA), and gene fusion. Results Seven distinct cluster gene expression patterns were identified among the differentially expressed genes in normal, ESCC, and EAC tissues. These patterns were enriched in the PI3K-Akt signaling pathway and the activation of extracellular matrix organization and exhibited repression of epidermal development. Notably, we observed additional genes or unique expression levels enriched in these shared pathways and biological processes related to tumor development and immune activation. In addition to the differentially expressed genes, there was an enrichment of lncRNA co-expression networks and downregulation of promoter activity associated with the repression of epidermal development in both ESCC and EAC. This indicates a common feature between these two cancer subtypes. Furthermore, differential AS and APA patterns in ESCC and EAC appear to partially affect the expression of host genes associated with bacterial or viral infections in these subtypes. No gene fusions were observed between ESCC and EAC, thus highlighting the distinct molecular mechanisms underlying these two cancer subtypes. Conclusions We conducted a comprehensive comparison of ESCC and EAC transcriptomes and uncovered shared and distinct transcriptomic signatures at multiple levels. These findings suggest that ESCC and EAC may exhibit common and unique mechanisms involved in tumorigenesis.
Collapse
Affiliation(s)
- Xianfeng Li
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
- Department of Gastroenterology and Chongqing Key Laboratory of Digestive Malignancies, Daping Hospital, Army Medical University (Third Military Medical University), 10# Changjiang Branch Road, Yuzhong District, Chongqing 400042, People's Republic of China
- Institute of Pathology and Southwest Cancer Center, Key Laboratory of Tumor Immunopathology of Ministry of Education of China, Southwest Hospital, Army Medical University (Third Military Medical University), Chongqing 400038, People's Republic of China
- Jinfeng Laboratory, Chongqing 401329, People's Republic of China
| | - Yan Wang
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Qingjie Min
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Weimin Zhang
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Huajing Teng
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Chao Li
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou 325035, China
| | - Kun Zhang
- Institute of Genomic Medicine, Wenzhou Medical University, Wenzhou 325035, China
| | - Leisheng Shi
- Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Bin Wang
- Department of Gastroenterology and Chongqing Key Laboratory of Digestive Malignancies, Daping Hospital, Army Medical University (Third Military Medical University), 10# Changjiang Branch Road, Yuzhong District, Chongqing 400042, People's Republic of China
- Institute of Pathology and Southwest Cancer Center, Key Laboratory of Tumor Immunopathology of Ministry of Education of China, Southwest Hospital, Army Medical University (Third Military Medical University), Chongqing 400038, People's Republic of China
- Jinfeng Laboratory, Chongqing 401329, People's Republic of China
| | - Qimin Zhan
- Key laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Laboratory of Molecular Oncology, Peking University Cancer Hospital & Institute, Beijing 100142, China
| |
Collapse
|
17
|
Bryce-Smith S, Burri D, Gazzara MR, Herrmann CJ, Danecka W, Fitzsimmons CM, Wan YK, Zhuang F, Fansler MM, Fernández JM, Ferret M, Gonzalez-Uriarte A, Haynes S, Herdman C, Kanitz A, Katsantoni M, Marini F, McDonnel E, Nicolet B, Poon CL, Rot G, Schärfen L, Wu PJ, Yoon Y, Barash Y, Zavolan M. Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.23.546284. [PMID: 37425672 PMCID: PMC10327023 DOI: 10.1101/2023.06.23.546284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, and limitations and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3'-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for seamless extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies. Furthermore, the containers and reproducible workflows generated in the course of this project can be seamlessly deployed and extended in the future to evaluate new methods or datasets.
Collapse
Affiliation(s)
- Sam Bryce-Smith
- UCL Queen Square Motor Neuron Disease Centre, Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, UCL, London, UK
| | - Dominik Burri
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Matthew R. Gazzara
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
| | - Christina J. Herrmann
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Weronika Danecka
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
| | - Christina M. Fitzsimmons
- Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Yuk Kei Wan
- Genome Institute of Singapore, Buona Vista, Singapore
- National University of Singapore, Kent Ridge, Singapore
| | - Farica Zhuang
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, USA
| | - Mervin M. Fansler
- Tri-Institutional Program in Computational Biology and Medicine, Weill Cornell GraduateStudies, New York, NY, USA
- Cancer Biology and Genetics, Sloan-Kettering Institute, MSKCC, New York, NY, USA
| | - José M. Fernández
- Barcelona Supercomputing Center, Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES)
| | - Meritxell Ferret
- Barcelona Supercomputing Center, Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES)
| | - Asier Gonzalez-Uriarte
- Barcelona Supercomputing Center, Barcelona, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES)
| | - Samuel Haynes
- Institute for Cell Biology, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom
| | | | - Alexander Kanitz
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Maria Katsantoni
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Federico Marini
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI) - UniversityMedical Center of the Johannes Gutenberg, University Mainz, Germany
| | - Euan McDonnel
- Leeds Institute for Data Analytics, School of Molecular and Cellular Biology, University of Leeds, United Kingdom
| | - Ben Nicolet
- Department of Hematopoiesis, Sanquin Research, Landsteiner Laboratory, AmsterdamUMC, University of Amsterdam, and Oncode Institute, Amsterdam, The Netherlands
| | | | - Gregor Rot
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute of Molecular Life Sciences, Zurich, Switzerland
| | - Leonard Schärfen
- Department of Molecular Biophysics & Biochemistry, Yale University, New Haven CT, USA
| | - Pin-Jou Wu
- Center for Plant Molecular Biology (ZMBP), University of Tübingen, Germany
| | - Yoseop Yoon
- Department of Microbiology and Molecular Genetics, School of Medicine, University of California Irvine, Irvine, California, USA
| | - Yoseph Barash
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
- Department of Computer and Information Science, School of Engineering, University of Pennsylvania, Philadelphia, USA
| | - Mihaela Zavolan
- Biozentrum, University of Basel, Basel, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
18
|
Vlasenok M, Margasyuk S, Pervouchine DD. Transcriptome sequencing suggests that pre-mRNA splicing counteracts widespread intronic cleavage and polyadenylation. NAR Genom Bioinform 2023; 5:lqad051. [PMID: 37260513 PMCID: PMC10227441 DOI: 10.1093/nargab/lqad051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 05/09/2023] [Accepted: 05/17/2023] [Indexed: 06/02/2023] Open
Abstract
Alternative splicing (AS) and alternative polyadenylation (APA) are two crucial steps in the post-transcriptional regulation of eukaryotic gene expression. Protocols capturing and sequencing RNA 3'-ends have uncovered widespread intronic polyadenylation (IPA) in normal and disease conditions, where it is currently attributed to stochastic variations in the pre-mRNA processing. Here, we took advantage of the massive amount of RNA-seq data generated by the Genotype Tissue Expression project (GTEx) to simultaneously identify and match tissue-specific expression of intronic polyadenylation sites with tissue-specific splicing. A combination of computational methods including the analysis of short reads with non-templated adenines revealed that APA events are more abundant in introns than in exons. While the rate of IPA in composite terminal exons and skipped terminal exons expectedly correlates with splicing, we observed a considerable fraction of IPA events that lack AS support and attributed them to spliced polyadenylated introns (SPI). We hypothesize that SPIs represent transient byproducts of a dynamic coupling between APA and AS, in which the spliceosome removes the intron while it is being cleaved and polyadenylated. These findings indicate that cotranscriptional pre-mRNA splicing could serve as a rescue mechanism to suppress premature transcription termination at intronic polyadenylation sites.
Collapse
Affiliation(s)
- Maria Vlasenok
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar 30, Moscow 121205, Russia
| | - Sergey Margasyuk
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar 30, Moscow 121205, Russia
| | - Dmitri D Pervouchine
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Bolshoy Bulvar 30, Moscow 121205, Russia
| |
Collapse
|
19
|
Urso SJ, Sathaseevan A, Brent Derry W, Lamitina T. Regulation of the hypertonic stress response by the 3' mRNA cleavage and polyadenylation complex. Genetics 2023; 224:iyad051. [PMID: 36972377 PMCID: PMC10490458 DOI: 10.1093/genetics/iyad051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 03/10/2023] [Accepted: 03/10/2023] [Indexed: 03/29/2023] Open
Abstract
Maintenance of osmotic homeostasis is one of the most aggressively defended homeostatic set points in physiology. One major mechanism of osmotic homeostasis involves the upregulation of proteins that catalyze the accumulation of solutes called organic osmolytes. To better understand how osmolyte accumulation proteins are regulated, we conducted a forward genetic screen in Caenorhabditis elegans for mutants with no induction of osmolyte biosynthesis gene expression (Nio mutants). The nio-3 mutant encoded a missense mutation in cpf-2/CstF64, while the nio-7 mutant encoded a missense mutation in symk-1/Symplekin. Both cpf-2 and symk-1 are nuclear components of the highly conserved 3' mRNA cleavage and polyadenylation complex. cpf-2 and symk-1 block the hypertonic induction of gpdh-1 and other osmotically induced mRNAs, suggesting they act at the transcriptional level. We generated a functional auxin-inducible degron (AID) allele for symk-1 and found that acute, post-developmental degradation in the intestine and hypodermis was sufficient to cause the Nio phenotype. symk-1 and cpf-2 exhibit genetic interactions that strongly suggest they function through alterations in 3' mRNA cleavage and/or alternative polyadenylation. Consistent with this hypothesis, we find that inhibition of several other components of the mRNA cleavage complex also cause a Nio phenotype. cpf-2 and symk-1 specifically affect the osmotic stress response since heat shock-induced upregulation of a hsp-16.2::GFP reporter is normal in these mutants. Our data suggest a model in which alternative polyadenylation of 1 or more mRNAs is essential to regulate the hypertonic stress response.
Collapse
Affiliation(s)
- Sarel J Urso
- Graduate Program in Cell Biology and Molecular Physiology, University of Pittsburgh Medical Center, Pittsburgh, PA 15261, USA
- Department of Cell Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15261, USA
| | - Anson Sathaseevan
- Developmental and Stem Cell Biology Program, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - W Brent Derry
- Developmental and Stem Cell Biology Program, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Todd Lamitina
- Graduate Program in Cell Biology and Molecular Physiology, University of Pittsburgh Medical Center, Pittsburgh, PA 15261, USA
- Department of Cell Biology, University of Pittsburgh School of Medicine, Pittsburgh, PA 15261, USA
- Division of Child Neurology, Department of Pediatrics, Children's Hospital of Pittsburgh, University of Pittsburgh Medical Center, Pittsburgh, PA 15224, USA
| |
Collapse
|
20
|
Rodrigues DC, Mufteev M, Yuki KE, Narula A, Wei W, Piekna A, Liu J, Pasceri P, Rissland OS, Wilson MD, Ellis J. Buffering of transcription rate by mRNA half-life is a conserved feature of Rett syndrome models. Nat Commun 2023; 14:1896. [PMID: 37019888 PMCID: PMC10076348 DOI: 10.1038/s41467-023-37339-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Accepted: 03/13/2023] [Indexed: 04/07/2023] Open
Abstract
Transcriptional changes in Rett syndrome (RTT) are assumed to directly correlate with steady-state mRNA levels, but limited evidence in mice suggests that changes in transcription can be compensated by post-transcriptional regulation. We measure transcription rate and mRNA half-life changes in RTT patient neurons using RATEseq, and re-interpret nuclear and whole-cell RNAseq from Mecp2 mice. Genes are dysregulated by changing transcription rate or half-life and are buffered when both change. We utilized classifier models to predict the direction of transcription rate changes and find that combined frequencies of three dinucleotides are better predictors than CA and CG. MicroRNA and RNA-binding Protein (RBP) motifs are enriched in 3'UTRs of genes with half-life changes. Nuclear RBP motifs are enriched on buffered genes with increased transcription rate. We identify post-transcriptional mechanisms in humans and mice that alter half-life or buffer transcription rate changes when a transcriptional modulator gene is mutated in a neurodevelopmental disorder.
Collapse
Affiliation(s)
- Deivid C Rodrigues
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Marat Mufteev
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
| | - Kyoko E Yuki
- Genetics & Genome Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Ashrut Narula
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Molecular Medicine, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Wei Wei
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Alina Piekna
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Jiajie Liu
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Peter Pasceri
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - Olivia S Rissland
- Molecular Medicine, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
- RNA Bioscience Initiative and Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | - Michael D Wilson
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Genetics & Genome Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada
| | - James Ellis
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, ON, M5G 0A4, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada.
| |
Collapse
|
21
|
Zhang Z, Bae B, Cuddleston WH, Miura P. Coordination of Alternative Splicing and Alternative Polyadenylation revealed by Targeted Long-Read Sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.23.533999. [PMID: 36993601 PMCID: PMC10055423 DOI: 10.1101/2023.03.23.533999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Nervous system development is associated with extensive regulation of alternative splicing (AS) and alternative polyadenylation (APA). AS and APA have been extensively studied in isolation, but little is known about how these processes are coordinated. Here, the coordination of cassette exon (CE) splicing and APA in Drosophila was investigated using a targeted long-read sequencing approach we call Pull-a-Long-Seq (PL-Seq). This cost-effective method uses cDNA pulldown and Nanopore sequencing combined with an analysis pipeline to resolve the connectivity of alternative exons to alternative 3' ends. Using PL-Seq, we identified genes that exhibit significant differences in CE splicing depending on connectivity to short versus long 3'UTRs. Genomic long 3'UTR deletion was found to alter upstream CE splicing in short 3'UTR isoforms and ELAV loss differentially affected CE splicing depending on connectivity to alternative 3'UTRs. This work highlights the importance of considering connectivity to alternative 3'UTRs when monitoring AS events.
Collapse
Affiliation(s)
- Zhiping Zhang
- Department of Biology, University of Nevada, Reno, Reno, NV, USA
- Department of Genetics and Genome Sciences, UConn Health, Farmington, CT, USA
| | - Bongmin Bae
- Department of Biology, University of Nevada, Reno, Reno, NV, USA
| | | | - Pedro Miura
- Department of Biology, University of Nevada, Reno, Reno, NV, USA
- Department of Genetics and Genome Sciences, UConn Health, Farmington, CT, USA
| |
Collapse
|
22
|
Mufteev M, Rodrigues DC, Yuki KE, Narula A, Wei W, Piekna A, Liu J, Pasceri P, Rissland OS, Wilson MD, Ellis J. Transcriptional buffering and 3'UTR lengthening are shaped during human neurodevelopment by shifts in mRNA stability and microRNA load. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.01.530249. [PMID: 36909614 PMCID: PMC10002768 DOI: 10.1101/2023.03.01.530249] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/07/2023]
Abstract
The contribution of mRNA half-life is commonly overlooked when examining changes in mRNA abundance during development. mRNA levels of some genes are regulated by transcription rate only, but others may be regulated by mRNA half-life only shifts. Furthermore, transcriptional buffering is predicted when changes in transcription rates have compensating shifts in mRNA half-life resulting in no change to steady-state levels. Likewise, transcriptional boosting should result when changes in transcription rate are accompanied by amplifying half-life shifts. During neurodevelopment there is widespread 3'UTR lengthening that could be shaped by differential shifts in the stability of existing short or long 3'UTR transcript isoforms. We measured transcription rate and mRNA half-life changes during induced human Pluripotent Stem Cell (iPSC)-derived neuronal development using RATE-seq. During transitions to progenitor and neuron stages, transcriptional buffering occurred in up to 50%, and transcriptional boosting in up to 15%, of genes with changed transcription rates. The remaining changes occurred by transcription rate only or mRNA half-life only shifts. Average mRNA half-life decreased two-fold in neurons relative to iPSCs. Short gene isoforms were more destabilized in neurons and thereby increased the average 3'UTR length. Small RNA sequencing captured an increase in microRNA copy number per cell during neurodevelopment. We propose that mRNA destabilization and 3'UTR lengthening are driven in part by an increase in microRNA load in neurons. Our findings identify mRNA stability mechanisms in human neurodevelopment that regulate gene and isoform level abundance and provide a precedent for similar post-transcriptional regulatory events as other tissues develop.
Collapse
Affiliation(s)
- Marat Mufteev
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Deivid C Rodrigues
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Kyoko E Yuki
- Genetics & Genome Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Ashrut Narula
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Wei Wei
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Alina Piekna
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Jiajie Liu
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Peter Pasceri
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - Olivia S Rissland
- Molecular Medicine, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- RNA Bioscience Initiative and Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
| | - Michael D Wilson
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Genetics & Genome Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
| | - James Ellis
- Developmental & Stem Cell Biology, Hospital for Sick Children, Toronto, Ontario M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
23
|
He PC, Wei J, Dou X, Harada BT, Zhang Z, Ge R, Liu C, Zhang LS, Yu X, Wang S, Lyu R, Zou Z, Chen M, He C. Exon architecture controls mRNA m 6A suppression and gene expression. Science 2023; 379:677-682. [PMID: 36705538 PMCID: PMC9990141 DOI: 10.1126/science.abj9090] [Citation(s) in RCA: 67] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 01/16/2023] [Indexed: 01/28/2023]
Abstract
N6-methyladenosine (m6A) is the most abundant messenger RNA (mRNA) modification and plays crucial roles in diverse physiological processes. Using a massively parallel assay for m6A (MPm6A), we discover that m6A specificity is globally regulated by suppressors that prevent m6A deposition in unmethylated transcriptome regions. We identify exon junction complexes (EJCs) as m6A suppressors that protect exon junction-proximal RNA within coding sequences from methylation and regulate mRNA stability through m6A suppression. EJC suppression of m6A underlies multiple global characteristics of mRNA m6A specificity, with the local range of EJC protection sufficient to suppress m6A deposition in average-length internal exons but not in long internal and terminal exons. EJC-suppressed methylation sites colocalize with EJC-suppressed splice sites, which suggests that exon architecture broadly determines local mRNA accessibility to regulatory complexes.
Collapse
Affiliation(s)
- P. Cody He
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Committee on Immunology, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Jiangbo Wei
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Xiaoyang Dou
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Bryan T. Harada
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Zijie Zhang
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
- State Key Laboratory for Conservation and Utilization of Bio-Resources, School of Life Sciences, Yunnan University, Kunming, Yunnan 650091, China
| | - Ruiqi Ge
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Chang Liu
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Li-Sheng Zhang
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Xianbin Yu
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Shuai Wang
- Department of Neurobiology, The University of Chicago, Chicago, IL 60637, USA
| | - Ruitu Lyu
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Zhongyu Zou
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| | - Mengjie Chen
- Department of Human Genetics, The University of Chicago, Chicago, IL 60637, USA
- Section of Genetic Medicine, Department of Medicine, The University of Chicago, Chicago, IL 60637, USA
| | - Chuan He
- Department of Chemistry, Department of Biochemistry and Molecular Biology, Institute for Biophysical Dynamics, The University of Chicago, Chicago, IL 60637, USA
- Committee on Immunology, The University of Chicago, Chicago, IL 60637, USA
- Howard Hughes Medical Institute, The University of Chicago, Chicago, IL 60637, USA
| |
Collapse
|
24
|
Imada EL, Wilks C, Langmead B, Marchionni L. REPAC: analysis of alternative polyadenylation from RNA-sequencing data. Genome Biol 2023; 24:22. [PMID: 36759904 PMCID: PMC9912678 DOI: 10.1186/s13059-023-02865-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 01/24/2023] [Indexed: 02/11/2023] Open
Abstract
Alternative polyadenylation (APA) is an important post-transcriptional mechanism that has major implications in biological processes and diseases. Although specialized sequencing methods for polyadenylation exist, availability of these data are limited compared to RNA-sequencing data. We developed REPAC, a framework for the analysis of APA from RNA-sequencing data. Using REPAC, we investigate the landscape of APA caused by activation of B cells. We also show that REPAC is faster than alternative methods by at least 7-fold and that it scales well to hundreds of samples. Overall, the REPAC method offers an accurate, easy, and convenient solution for the exploration of APA.
Collapse
Affiliation(s)
- Eddie L. Imada
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, USA
| | - Christopher Wilks
- Department of Computer Science, Johns Hopkins University, Baltimore, USA
| | - Ben Langmead
- Department of Computer Science, Johns Hopkins University, Baltimore, USA
| | - Luigi Marchionni
- Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, USA
| |
Collapse
|
25
|
Zhao T, Zhan D, Qu S, Jiang S, Gan W, Qin W, Zheng C, Cheng F, Lu Y, Liu M, Shi J, Liang H, Wang Y, Qin J, Zen K, Liu Z. Transcriptomics-proteomics Integration reveals alternative polyadenylation driving inflammation-related protein translation in patients with diabetic nephropathy. J Transl Med 2023; 21:86. [PMID: 36747266 PMCID: PMC9900993 DOI: 10.1186/s12967-023-03934-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 01/26/2023] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Diabetic nephropathy (DN) is a complex disease involving the upregulation of many inflammation-related proteins. Alternative polyadenylation (APA), a crucial post-transcriptional regulatory mechanism, has been proven to play vital roles in many inflammatory diseases. However, it is largely unknown whether and how APA exerts function in DN. METHODS We performed transcriptomics and proteomics analysis of glomeruli samples isolated from 50 biopsy-proven DN patients and 25 control subjects. DaPars and QAPA algorithms were adopted to identify APA events from RNA-seq data. The qRT-PCR analysis was conducted to verify 3'UTR length alteration. Short and long 3'UTRs isoforms were also overexpressed in podocytes under hyperglycemia condition for examining protein expression. RESULTS We detected transcriptome-wide 3'UTR APA events in DN, and found that APA-mediated 3'UTR lengthening of genes (APA genes) increased their expression at protein but not mRNA level. Increased protein level of 3'UTR lengthening gene was validated in podocytes under hyperglycemia condition. Pathway enrichment analysis showed that APA genes were enriched in inflammation-related biological processes including endoplasmic reticulum stress pathways, NF-κB signaling and autophagy. Further bioinformatics analysis demonstrated that 3'UTR APA of genes probably altered the binding sites for RNA-binding proteins, thus enhancing protein translation. CONCLUSION This study revealed for the first time that 3'UTR lengthening of APA genes contributed to the progression of DN by elevating the translation of corresponding proteins, providing new insight and a rich resource for investigating DN mechanisms.
Collapse
Affiliation(s)
- Tingting Zhao
- grid.440259.e0000 0001 0115 7868National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002 Jiangsu China
| | - Dongdong Zhan
- grid.419611.a0000 0004 0457 9072State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206 China
| | - Shuang Qu
- grid.41156.370000 0001 2314 964XState Key Laboratory of Pharmaceutical Biotechnology, Nanjing University School of Life Sciences, Nanjing, 210093 Jiangsu China
| | - Song Jiang
- grid.440259.e0000 0001 0115 7868National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002 Jiangsu China
| | - Wenhua Gan
- grid.41156.370000 0001 2314 964XState Key Laboratory of Pharmaceutical Biotechnology, Nanjing University School of Life Sciences, Nanjing, 210093 Jiangsu China
| | - Weisong Qin
- grid.440259.e0000 0001 0115 7868National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002 Jiangsu China
| | - Chunxia Zheng
- grid.440259.e0000 0001 0115 7868National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002 Jiangsu China
| | - Fang Cheng
- grid.419611.a0000 0004 0457 9072State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206 China
| | - Yinghui Lu
- grid.440259.e0000 0001 0115 7868National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002 Jiangsu China
| | - Mingwei Liu
- grid.419611.a0000 0004 0457 9072State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206 China
| | - Jinsong Shi
- grid.440259.e0000 0001 0115 7868National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002 Jiangsu China
| | - Hongwei Liang
- grid.41156.370000 0001 2314 964XState Key Laboratory of Pharmaceutical Biotechnology, Nanjing University School of Life Sciences, Nanjing, 210093 Jiangsu China
| | - Yi Wang
- grid.419611.a0000 0004 0457 9072State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206 China
| | - Jun Qin
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China.
| | - Ke Zen
- State Key Laboratory of Pharmaceutical Biotechnology, Nanjing University School of Life Sciences, Nanjing, 210093, Jiangsu, China.
| | - Zhihong Liu
- National Clinical Research Center of Kidney Diseases, Jinling Hospital, Nanjing University School of Medicine, Nanjing, 210002, Jiangsu, China.
| |
Collapse
|
26
|
Wen H, Chen W, Chen Y, Wei G, Ni T. Integrative analysis of Iso-Seq and RNA-seq reveals dynamic changes of alternative promoter, alternative splicing and alternative polyadenylation during Angiotensin II-induced senescence in rat primary aortic endothelial cells. Front Genet 2023; 14:1064624. [PMID: 36741323 PMCID: PMC9892061 DOI: 10.3389/fgene.2023.1064624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 01/10/2023] [Indexed: 01/21/2023] Open
Abstract
In eukaryotes, alternative promoter (AP), alternative splicing (AS), and alternative polyadenylation (APA) are three crucial regulatory mechanisms that modulate message RNA (mRNA) diversity. Although AP, AS and APA are involved in diverse biological processess, whether they have dynamic changes in Angiotensin II (Ang II) induced senescence in rat primary aortic endothelial cells (RAECs), an important cellular model for studying cardiovascular disease, remains unclear. Here we integrated both PacBio single-molecule long-read isoform sequencing (Iso-Seq) and Illumina short-read RNA sequencing (RNA-seq) to analyze the changes of AP, AS and APA in Ang II-induced senescent RAECs. Iso-Seq generated 36,278 isoforms from 10,145 gene loci and 65.81% of these isoforms are novel, which were further cross-validated by public data obtained by other techonologies such as CAGE, PolyA-Seq and 3'READS. APA contributed most to novel isoforms, followed by AS and AP. Further investigation showed that AP, AS and APA could all contribute to the regulation of isoform, but AS has more dynamic changes compared to AP and APA upon Ang II stimulation. Genes undergoing AP, AS and APA in Ang II-treated cells are enriched in various pathways related to aging or senescence, suggesting that these molecular changes are involved in functional alterations during Ang II-induced senescence. Together, the present study largely improved the annotation of rat genome and revealed gene expression changes at isoform level, extending the understanding of the complexity of gene regulation in Ang II-treated RAECs, and also provided novel clues for discovering the regulatory mechanism undelying Ang II caused vascular senescence and diseases.
Collapse
Affiliation(s)
- Haimei Wen
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China,Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
| | - Wei Chen
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Yu Chen
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China
| | - Gang Wei
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China,Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China,*Correspondence: Ting Ni, ; Gang Wei,
| | - Ting Ni
- Collaborative Innovation Center of Genetics and Development, School of Life Sciences, Human Phenome Institute, Fudan University, Shanghai, China,*Correspondence: Ting Ni, ; Gang Wei,
| |
Collapse
|
27
|
Chan K, Farias AG, Lee H, Guvenc F, Mero P, Brown KR, Ward H, Billmann M, Aulakh K, Astori A, Haider S, Marcon E, Braunschweig U, Pu S, Habsid A, Yan Tong AH, Christie-Holmes N, Budylowski P, Ghalami A, Mubareka S, Maguire F, Banerjee A, Mossman KL, Greenblatt J, Gray-Owen SD, Raught B, Blencowe BJ, Taipale M, Myers C, Moffat J. Survival-based CRISPR genetic screens across a panel of permissive cell lines identify common and cell-specific SARS-CoV-2 host factors. Heliyon 2023; 9:e12744. [PMID: 36597481 PMCID: PMC9800021 DOI: 10.1016/j.heliyon.2022.e12744] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 12/15/2022] [Accepted: 12/22/2022] [Indexed: 12/31/2022] Open
Abstract
SARS-CoV-2 depends on host cell components for infection and replication. Identification of virus-host dependencies offers an effective way to elucidate mechanisms involved in viral infection and replication. If druggable, host factor dependencies may present an attractive strategy for anti-viral therapy. In this study, we performed genome wide CRISPR knockout screens in Vero E6 cells and four human cell lines including Calu-3, UM-UC-4, HEK-293 and HuH-7 to identify genetic regulators of SARS-CoV-2 infection. Our findings identified only ACE2, the cognate SARS-CoV-2 entry receptor, as a common host dependency factor across all cell lines, while other host genes identified were largely cell line specific, including known factors TMPRSS2 and CTSL. Several of the discovered host-dependency factors converged on pathways involved in cell signalling, immune-related pathways, and chromatin modification. Notably, the chromatin modifier gene KMT2C in Calu-3 cells had the strongest impact in preventing SARS-CoV-2 infection when perturbed.
Collapse
Affiliation(s)
- Katherine Chan
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1,Corresponding author
| | - Adrian Granda Farias
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1,Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Hunsang Lee
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Furkan Guvenc
- Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Patricia Mero
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Kevin R. Brown
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Henry Ward
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, Minneapolis, MN, USA
| | - Maximilian Billmann
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, Minneapolis, MN, USA
| | - Kamaldeep Aulakh
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Audrey Astori
- Princess Margaret Cancer Center, Toronto, Ontario, Canada
| | - Shahan Haider
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Edyta Marcon
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Ulrich Braunschweig
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Shuye Pu
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Andrea Habsid
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Amy Hin Yan Tong
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| | - Natasha Christie-Holmes
- Combined Containment Level 3 Unit, Temerty Faculty of Medicine, University of Toronto Toronto, Ontario, Canada, M5S3E1
| | - Patrick Budylowski
- Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Ayoob Ghalami
- Office of Environmental Health & Safety, University of Toronto, Toronto, Ontario, Canada
| | - Samira Mubareka
- Sunnybrook Research Institute, Toronto, Ontario, Canada, M5S3E1,Department of Laboratory Medicine and Pathobiology, University of Toronto, Ontario, Canada
| | - Finlay Maguire
- Department of Community Health and Epidemiology, Faculty of Medicine Dalhousie University, Halifax, Nova Scotia, Canada,Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Arinjay Banerjee
- Vaccine and Infectious Disease Organization, Department of Veterinary Microbiology, University of Saskatchewan, Saskatoon, Saskatchewan, Canada
| | - Karen L. Mossman
- Department of Medicine, McMaster University, Hamilton, Ontario, Canada
| | - Jack Greenblatt
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1,Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Scott D. Gray-Owen
- Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Brian Raught
- Princess Margaret Cancer Center, Toronto, Ontario, Canada
| | - Benjamin J. Blencowe
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1,Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Mikko Taipale
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1,Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8
| | - Chad Myers
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, Minneapolis, MN, USA
| | - Jason Moffat
- Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1,Department of Molecular Genetics, 1 King's College Circle, University of Toronto, Toronto, Ontario, Canada, M5S1A8,Institute for Biomedical Engineering, Rosebrugh Building, 164 College Street, Room 407, University of Toronto, Toronto, Ontario, Canada, M5S3G9,Corresponding author. Donnelly Center, 160 College Street, University of Toronto, Toronto, Ontario, Canada, M5S3E1
| |
Collapse
|
28
|
Comprehensive mapping of alternative polyadenylation site usage and its dynamics at single-cell resolution. Proc Natl Acad Sci U S A 2022; 119:e2113504119. [PMID: 36454750 PMCID: PMC9894249 DOI: 10.1073/pnas.2113504119] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Alternative polyadenylation (APA) plays an important role in posttranscriptional gene regulation such as transcript stability and translation efficiency. However, our knowledge about APA dynamics at the single-cell level is largely unexplored. Here, we developed single-cell polyadenylation sequencing, a strand-specific approach for sequencing the 3' end of transcripts, to investigate the landscape of APA at the single-cell level. By analyzing several cell lines, we found many genes using multiple polyA sites in bulk data are prone to use only one polyA site in each single cell. Interestingly, cell cycle genes were significantly enriched in genes with high variation in polyA site usages. Furthermore, the 414 genes showing a polyA site usage switch after cell synchronization enriched cell cycle genes, while the differentially expressed genes after cell synchronization did not enrich cell cycle genes. We further identified 812 genes showing polyA site usage changes between neighboring cell cycles, which were grouped into six clusters, with cell phase-specific functional categories enriched in each cluster. Deletion of one polyA site in MSL1 and SCCPDH results in slower and faster cell cycle progression, respectively, supporting polyA site usage switch played an important role in cell cycle. These results indicate that APA is an important layer for cell cycle regulation.
Collapse
|
29
|
Kang W, Yang Y, Chen C, Yu C. CPSF1 positively regulates NSDHL by alternative polyadenylation and promotes gastric cancer progression. Am J Cancer Res 2022; 12:4566-4583. [PMID: 36381317 PMCID: PMC9641403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 09/29/2022] [Indexed: 06/16/2023] Open
Abstract
Gastric cancer (GC) is a common malignancies with unfavourable prognosis. As one of the most common RNA modifications in nature, alternative polyadenylation (APA) plays a critical role in the progression of carcinomas. CPSF1 is a critical APA-related factor and is involved in many cancers. Nevertheless, the roles and underlying mechanisms of CPSF1 remain unclear in GC. In this work, we identified that CPSF1 is significantly upregulated in GC and that high CPSF1 expression indicates an unfavourable prognosis in GC patients. Moreover, CPSF1 expression levels were closely associated with tumour size, TNM stage and lymph node metastasis. CPSF1 depletion dramatically weakened GC cell proliferation and metastasis. We then performed RNA sequencing and found numerous downstream genes involved the regulation of CPSF1 with remarkable changes in 3'UTR length, among which NSDHL was positively regulated by CPSF1 and promoted GC progression. In addition, rescue assays demonstrated that NSDHL mediated the carcinogenic effect of CPSF1, and this process potentially involved APA. Therefore, this study showed that CPSF1 promotes GC progression, at least in part, by enhancing NSDHL and offered new insights into therapeutic targets for GC.
Collapse
|
30
|
Witkowski MT, Lee S, Wang E, Lee AK, Talbot A, Ma C, Tsopoulidis N, Brumbaugh J, Zhao Y, Roberts KG, Hogg SJ, Nomikou S, Ghebrechristos YE, Thandapani P, Mullighan CG, Hochedlinger K, Chen W, Abdel-Wahab O, Eyquem J, Aifantis I. NUDT21 limits CD19 levels through alternative mRNA polyadenylation in B cell acute lymphoblastic leukemia. Nat Immunol 2022; 23:1424-1432. [PMID: 36138187 PMCID: PMC9611506 DOI: 10.1038/s41590-022-01314-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 08/11/2022] [Indexed: 02/04/2023]
Abstract
B cell progenitor acute lymphoblastic leukemia (B-ALL) treatment has been revolutionized by T cell-based immunotherapies-including chimeric antigen receptor T cell therapy (CAR-T) and the bispecific T cell engager therapeutic, blinatumomab-targeting surface glycoprotein CD19. Unfortunately, many patients with B-ALL will fail immunotherapy due to 'antigen escape'-the loss or absence of leukemic CD19 targeted by anti-leukemic T cells. In the present study, we utilized a genome-wide CRISPR-Cas9 screening approach to identify modulators of CD19 abundance on human B-ALL blasts. These studies identified a critical role for the transcriptional activator ZNF143 in CD19 promoter activation. Conversely, the RNA-binding protein, NUDT21, limited expression of CD19 by regulating CD19 messenger RNA polyadenylation and stability. NUDT21 deletion in B-ALL cells increased the expression of CD19 and the sensitivity to CD19-specific CAR-T and blinatumomab. In human B-ALL patients treated with CAR-T and blinatumomab, upregulation of NUDT21 mRNA coincided with CD19 loss at disease relapse. Together, these studies identify new CD19 modulators in human B-ALL.
Collapse
Affiliation(s)
- Matthew T Witkowski
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA.
- Department of Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, USA.
| | - Soobeom Lee
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA
- Department of Biology, New York University (NYU), New York, NY, USA
| | - Eric Wang
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- The Jackson Laboratory for Genomic Medicine, Farmington, USA
| | - Anna K Lee
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA
| | - Alexis Talbot
- Department of Medicine, University of California San Francisco, San Francisco, CA, USA
| | - Chao Ma
- Department of Mechanical and Aerospace Engineering, New York University, New York, NY, USA
- Department of Biomedical Engineering, New York University, New York, NY, USA
| | - Nikolaos Tsopoulidis
- Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
- Cancer Center and Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Justin Brumbaugh
- Department of Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, CO, USA
| | - Yaqi Zhao
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Kathryn G Roberts
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Simon J Hogg
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Sofia Nomikou
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA
| | - Yohana E Ghebrechristos
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA
| | - Palaniraja Thandapani
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA
| | - Charles G Mullighan
- Department of Pathology, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Konrad Hochedlinger
- Department of Molecular Biology, Massachusetts General Hospital, Boston, MA, USA
- Cancer Center and Center for Regenerative Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Weiqiang Chen
- Department of Mechanical and Aerospace Engineering, New York University, New York, NY, USA
- Department of Biomedical Engineering, New York University, New York, NY, USA
| | - Omar Abdel-Wahab
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Justin Eyquem
- Department of Medicine, University of California San Francisco, San Francisco, CA, USA
- Gladstone-UCSF Institute of Genomic Immunology, San Francisco, CA, USA
- Department of Microbiology and Immunology, University of California San Francisco, San Francisco, CA, USA
- Parker Institute of Cancer Immunotherapy, University of California San Francisco, San Francisco, CA, USA
| | - Iannis Aifantis
- Department of Pathology and Laura & Isaac Perlmutter Cancer Center, NYU School of Medicine, New York, NY, USA.
| |
Collapse
|
31
|
Ye W, Lian Q, Ye C, Wu X. A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022:S1672-0229(22)00121-8. [PMID: 36167284 PMCID: PMC10372920 DOI: 10.1016/j.gpb.2022.09.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 08/17/2022] [Accepted: 09/19/2022] [Indexed: 05/08/2023]
Abstract
Alternative polyadenylation (APA) plays important roles in modulating mRNA stability, translation, and subcellular localization, and contributes extensively to shaping eukaryotic transcriptome complexity and proteome diversity. Identification of poly(A) sites (pAs) on a genome-wide scale is a critical step toward understanding the underlying mechanism of APA-mediated gene regulation. A number of established computational tools have been proposed to predict pAs from diverse genomic data. Here we provided an exhaustive overview of computational approaches for predicting pAs from DNA sequences, bulk RNA sequencing (RNA-seq) data, and single-cell RNA sequencing (scRNA-seq) data. Particularly, we examined several representative tools using bulk RNA-seq and scRNA-seq data from peripheral blood mononuclear cells and put forward operable suggestions on how to assess the reliability of pAs predicted by different tools. We also proposed practical guidelines on choosing appropriate methods applicable to diverse scenarios. Moreover, we discussed in depth the challenges in improving the performance of pA prediction and benchmarking different methods. Additionally, we highlighted outstanding challenges and opportunities using new machine learning and integrative multi-omics techniques, and provided our perspective on how computational methodologies might evolve in the future for non-3' untranslated region, tissue-specific, cross-species, and single-cell pA prediction.
Collapse
Affiliation(s)
- Wenbin Ye
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China
| | - Qiwei Lian
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China; Department of Automation, Xiamen University, Xiamen 361005, China
| | - Congting Ye
- Key Laboratory of the Coastal and Wetland Ecosystems, Ministry of Education, College of the Environment and Ecology, Xiamen University, Xiamen 361005, China
| | - Xiaohui Wu
- Pasteurien College, Suzhou Medical College of Soochow University, Soochow University, Suzhou 215000, China.
| |
Collapse
|
32
|
Lee S, Chen YC, Gillen AE, Taliaferro JM, Deplancke B, Li H, Lai EC. Diverse cell-specific patterns of alternative polyadenylation in Drosophila. Nat Commun 2022; 13:5372. [PMID: 36100597 PMCID: PMC9470587 DOI: 10.1038/s41467-022-32305-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 07/24/2022] [Indexed: 11/17/2022] Open
Abstract
Most genes in higher eukaryotes express isoforms with distinct 3' untranslated regions (3' UTRs), generated by alternative polyadenylation (APA). Since 3' UTRs are predominant locations of post-transcriptional regulation, APA can render such programs conditional, and can also alter protein sequences via alternative last exon (ALE) isoforms. We previously used 3'-sequencing from diverse Drosophila samples to define multiple tissue-specific APA landscapes. Here, we exploit comprehensive single nucleus RNA-sequencing data (Fly Cell Atlas) to elucidate cell-type expression of 3' UTRs across >250 adult Drosophila cell types. We reveal the cellular bases of multiple tissue-specific APA/ALE programs, such as 3' UTR lengthening in differentiated neurons and 3' UTR shortening in spermatocytes and spermatids. We trace dynamic 3' UTR patterns across cell lineages, including in the male germline, and discover new APA patterns in the intestinal stem cell lineage. Finally, we correlate expression of RNA binding proteins (RBPs), miRNAs and global levels of cleavage and polyadenylation (CPA) factors in several cell types that exhibit characteristic APA landscapes, yielding candidate regulators of transcriptome complexity. These analyses provide a comprehensive foundation for future investigations of mechanisms and biological impacts of alternative 3' isoforms across the major cell types of this widely-studied model organism.
Collapse
Affiliation(s)
- Seungjae Lee
- Developmental Biology Program, Sloan Kettering Institute, 1275 York Ave, Box 252, New York, NY, 10065, USA
| | - Yen-Chung Chen
- Department of Biology, New York University, New York, NY, 10013, USA
| | | | - Austin E Gillen
- Division of Hematology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.,Rocky Mountain Regional VA Medical Center, Aurora, CO, USA.,RNA Bioscience Initiative, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - J Matthew Taliaferro
- RNA Bioscience Initiative, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.,Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Bart Deplancke
- Laboratory of Systems Biology and Genetics, Institute of Bio-engineering & Global Health Institute, School of Life Sciences, EPFL, CH-1015, Lausanne, Switzerland
| | - Hongjie Li
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX, 77030, USA.,Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Eric C Lai
- Developmental Biology Program, Sloan Kettering Institute, 1275 York Ave, Box 252, New York, NY, 10065, USA.
| |
Collapse
|
33
|
Hu X, Song J, Chyr J, Wan J, Wang X, Du J, Duan J, Zhang H, Zhou X, Wu X. APAview: A web-based platform for alternative polyadenylation analyses in hematological cancers. Front Genet 2022; 13:928862. [PMID: 36035147 PMCID: PMC9411867 DOI: 10.3389/fgene.2022.928862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 07/06/2022] [Indexed: 11/30/2022] Open
Abstract
Background: Hematologic malignancies, such as acute promyelocytic leukemia (APL) and acute myeloid leukemia (AML), are cancers that start in blood-forming tissues and can affect the blood, bone marrow, and lymph nodes. They are often caused by genetic and molecular alterations such as mutations and gene expression changes. Alternative polyadenylation (APA) is a post-transcriptional process that regulates gene expression, and dysregulation of APA contributes to hematological malignancies. RNA-sequencing-based bioinformatic methods can identify APA sites and quantify APA usages as molecular indexes to study APA roles in disease development, diagnosis, and treatment. Unfortunately, APA data pre-processing, analysis, and visualization are time-consuming, inconsistent, and laborious. A comprehensive, user-friendly tool will greatly simplify processes for APA feature screening and mining. Results: Here, we present APAview, a web-based platform to explore APA features in hematological cancers and perform APA statistical analysis. APAview server runs on Python3 with a Flask framework and a Jinja2 templating engine. For visualization, APAview client is built on Bootstrap and Plotly. Multimodal data, such as APA quantified by QAPA/DaPars, gene expression data, and clinical information, can be uploaded to APAview and analyzed interactively. Correlation, survival, and differential analyses among user-defined groups can be performed via the web interface. Using APAview, we explored APA features in two hematological cancers, APL and AML. APAview can also be applied to other diseases by uploading different experimental data.
Collapse
Affiliation(s)
- Xi Hu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Jialin Song
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Jacqueline Chyr
- Center for Computational Systems Medicine, School of Biomedical Informatics, University of Texas Health Science Center, Houston, TX, United States
| | - Jinping Wan
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Xiaoyan Wang
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Jianqiang Du
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Junbo Duan
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Huqin Zhang
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
| | - Xiaobo Zhou
- Center for Computational Systems Medicine, School of Biomedical Informatics, University of Texas Health Science Center, Houston, TX, United States
| | - Xiaoming Wu
- The Key Laboratory of Biomedical Information Engineering of Ministry of Education, School of Life Science and Technology, Xi’an Jiaotong University, Xi’an, China
- *Correspondence: Xiaoming Wu,
| |
Collapse
|
34
|
Song J, Nabeel-Shah S, Pu S, Lee H, Braunschweig U, Ni Z, Ahmed N, Marcon E, Zhong G, Ray D, Ha KCH, Guo X, Zhang Z, Hughes TR, Blencowe BJ, Greenblatt JF. Regulation of alternative polyadenylation by the C2H2-zinc-finger protein Sp1. Mol Cell 2022; 82:3135-3150.e9. [PMID: 35914531 DOI: 10.1016/j.molcel.2022.06.031] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 04/09/2022] [Accepted: 06/28/2022] [Indexed: 11/18/2022]
Abstract
Alternative polyadenylation (APA) enhances gene regulatory potential by increasing the diversity of mRNA transcripts. 3' UTR shortening through APA correlates with enhanced cellular proliferation and is a widespread phenomenon in tumor cells. Here, we show that the ubiquitously expressed transcription factor Sp1 binds RNA in vivo and is a common repressor of distal poly(A) site usage. RNA sequencing identified 2,344 genes (36% of the total mapped mRNA transcripts) with lengthened 3' UTRs upon Sp1 depletion. Sp1 preferentially binds the 3' UTRs of such lengthened transcripts and inhibits cleavage at distal sites by interacting with the subunits of the core cleavage and polyadenylation (CPA) machinery. The 3' UTR lengths of Sp1 target genes in breast cancer patient RNA-seq data correlate with Sp1 expression levels, implicating Sp1-mediated APA regulation in modulating tumorigenic properties. Taken together, our findings provide insights into the mechanism for dynamic APA regulation by unraveling a previously unknown function of the DNA-binding transcription factor Sp1.
Collapse
Affiliation(s)
- Jingwen Song
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Syed Nabeel-Shah
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Shuye Pu
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Hyunmin Lee
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Computer Science, University of Toronto, 10 King's College Road, Toronto, ON M5S 3G4, Canada
| | - Ulrich Braunschweig
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Zuyao Ni
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Nujhat Ahmed
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Edyta Marcon
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Guoqing Zhong
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Debashish Ray
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Kevin C H Ha
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Xinghua Guo
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada
| | - Zhaolei Zhang
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada; Department of Computer Science, University of Toronto, 10 King's College Road, Toronto, ON M5S 3G4, Canada
| | - Timothy R Hughes
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Benjamin J Blencowe
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada
| | - Jack F Greenblatt
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 1 King's College Circle, Toronto, ON M5A 1A8, Canada.
| |
Collapse
|
35
|
Wheeler JR, Whitney ON, Vogler TO, Nguyen ED, Pawlikowski B, Lester E, Cutler A, Elston T, Dalla Betta N, Parker KR, Yost KE, Vogel H, Rando TA, Chang HY, Johnson AM, Parker R, Olwin BB. RNA-binding proteins direct myogenic cell fate decisions. eLife 2022; 11:e75844. [PMID: 35695839 PMCID: PMC9191894 DOI: 10.7554/elife.75844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 05/20/2022] [Indexed: 11/13/2022] Open
Abstract
RNA-binding proteins (RBPs), essential for skeletal muscle regeneration, cause muscle degeneration and neuromuscular disease when mutated. Why mutations in these ubiquitously expressed RBPs orchestrate complex tissue regeneration and direct cell fate decisions in skeletal muscle remains poorly understood. Single-cell RNA-sequencing of regenerating Mus musculus skeletal muscle reveals that RBP expression, including the expression of many neuromuscular disease-associated RBPs, is temporally regulated in skeletal muscle stem cells and correlates with specific stages of myogenic differentiation. By combining machine learning with RBP engagement scoring, we discovered that the neuromuscular disease-associated RBP Hnrnpa2b1 is a differentiation-specifying regulator of myogenesis that controls myogenic cell fate transitions during terminal differentiation in mice. The timing of RBP expression specifies cell fate transitions by providing post-transcriptional regulation of messenger RNAs that coordinate stem cell fate decisions during tissue regeneration.
Collapse
Affiliation(s)
- Joshua R Wheeler
- Department of Biochemistry, University of ColoradoBoulderUnited States
- Medical Scientist Training Program, University of Colorado Anschutz Medical CampusAuroraUnited States
- Howard Hughes Medical Institute, University of ColoradoBoulderUnited States
- Department of Pathology, Stanford UniversityStanfordUnited States
- Department of Neuropathology, Stanford UniversityStanfordUnited States
| | - Oscar N Whitney
- Department of Molecular and Cell Biology, University of California, BerkeleyBerkeleyUnited States
| | - Thomas O Vogler
- Medical Scientist Training Program, University of Colorado Anschutz Medical CampusAuroraUnited States
- Department of Molecular, Cellular and Developmental Biology, University of ColoradoBoulderUnited States
- Department of Surgery, University of ColoradoAuroraUnited States
| | - Eric D Nguyen
- Medical Scientist Training Program, University of Colorado Anschutz Medical CampusAuroraUnited States
- Molecular Biology Program and Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical CampusAuroraUnited States
| | - Bradley Pawlikowski
- Department of Molecular, Cellular and Developmental Biology, University of ColoradoBoulderUnited States
| | - Evan Lester
- Department of Biochemistry, University of ColoradoBoulderUnited States
- Medical Scientist Training Program, University of Colorado Anschutz Medical CampusAuroraUnited States
| | - Alicia Cutler
- Department of Molecular, Cellular and Developmental Biology, University of ColoradoBoulderUnited States
| | - Tiffany Elston
- Department of Molecular, Cellular and Developmental Biology, University of ColoradoBoulderUnited States
| | - Nicole Dalla Betta
- Department of Molecular, Cellular and Developmental Biology, University of ColoradoBoulderUnited States
| | - Kevin R Parker
- Center for Personal and Dynamic Regulomes, Stanford UniversityPalo AltoUnited States
| | - Kathryn E Yost
- Center for Personal and Dynamic Regulomes, Stanford UniversityPalo AltoUnited States
| | - Hannes Vogel
- Department of Pathology, Stanford UniversityStanfordUnited States
| | - Thomas A Rando
- Department of Neurology and Neurological Sciences, Stanford University School of MedicineStanfordUnited States
- Paul F. Glenn Center for the Biology of Aging, Stanford University School of MedicineStanfordUnited States
- Center for Tissue Regeneration, Repair, and Restoration, Veterans Affairs Palo Alto Health Care SystemPalo AltoUnited States
| | - Howard Y Chang
- Center for Personal and Dynamic Regulomes, Stanford UniversityPalo AltoUnited States
- Howard Hughes Medical Institute, Stanford UniversityStanfordUnited States
| | - Aaron M Johnson
- Molecular Biology Program and Department of Biochemistry and Molecular Genetics, University of Colorado Anschutz Medical CampusAuroraUnited States
- University of Colorado School of Medicine, RNA Bioscience Initiative, University of Colorado Anschutz Medical CampusAuroraUnited States
| | - Roy Parker
- Howard Hughes Medical Institute, University of ColoradoBoulderUnited States
| | - Bradley B Olwin
- Department of Molecular, Cellular and Developmental Biology, University of ColoradoBoulderUnited States
| |
Collapse
|
36
|
Guvenek A, Shin J, De Filippis L, Zheng D, Wang W, Pang ZP, Tian B. Neuronal Cells Display Distinct Stability Controls of Alternative Polyadenylation mRNA Isoforms, Long Non-Coding RNAs, and Mitochondrial RNAs. Front Genet 2022; 13:840369. [PMID: 35664307 PMCID: PMC9159357 DOI: 10.3389/fgene.2022.840369] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 02/28/2022] [Indexed: 11/25/2022] Open
Abstract
RNA stability plays an important role in gene expression. Here, using 3' end sequencing of newly made and pre-existing poly(A)+ RNAs, we compare transcript stability in multiple human cell lines, including HEK293T, HepG2, and SH-SY5Y. We show that while mRNA stability is generally conserved across the cell lines, specific transcripts having a high GC content and possibly more stable secondary RNA structures are relatively more stable in SH-SY5Y cells compared to the other 2 cell lines. These features also differentiate stability levels of alternative polyadenylation (APA) 3'UTR isoforms in a cell type-specific manner. Using differentiation of a neural stem cell line as a model, we show that mRNA stability difference could contribute to gene expression changes in neurogenesis and confirm the neuronal identity of SH-SY5Y cells at both gene expression and APA levels. In addition, compared to transcripts using 3'-most exon cleavage/polyadenylation sites (PASs), those using intronic PASs are generally less stable, especially when the PAS-containing intron is large and has a strong 5' splice site, suggesting that intronic polyadenylation mostly plays a negative role in gene expression. Interestingly, the differential mRNA stability among APA isoforms appears to buffer PAS choice in these cell lines. Moreover, we found that several other poly(A)+ RNA species, including promoter-associated long noncoding RNAs and transcripts encoded by the mitochondrial genome, are more stable in SH-SY5Y cells than the other 2 cell lines, further highlighting distinct RNA metabolism in neuronal cells. Together, our results indicate that distinct RNA stability control in neuronal cells may contribute to the gene expression and APA programs that define their cell identity.
Collapse
Affiliation(s)
- Aysegul Guvenek
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
- Rutgers School of Graduate Studies, Newark, NJ, United States
| | - Jihae Shin
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
| | - Lidia De Filippis
- Department of Neuroscience and Cell Biology, Child Health Institute of New Jersey, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, United States
| | - Dinghai Zheng
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
| | - Wei Wang
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
| | - Zhiping P. Pang
- Department of Neuroscience and Cell Biology, Child Health Institute of New Jersey, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, United States
| | - Bin Tian
- Department of Microbiology, Biochemistry and Molecular Genetics, Rutgers New Jersey Medical School, Newark, NJ, United States
- Program in Gene Expression and Regulation, Center for Systems and Computational Biology, The Wistar Institute, Philadelphia, PA, United States
| |
Collapse
|
37
|
Kwon B, Fansler MM, Patel ND, Lee J, Ma W, Mayr C. Enhancers regulate 3' end processing activity to control expression of alternative 3'UTR isoforms. Nat Commun 2022; 13:2709. [PMID: 35581194 PMCID: PMC9114392 DOI: 10.1038/s41467-022-30525-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 05/02/2022] [Indexed: 12/12/2022] Open
Abstract
Multi-UTR genes are widely transcribed and express their alternative 3'UTR isoforms in a cell type-specific manner. As transcriptional enhancers regulate mRNA expression, we investigated if they also regulate 3'UTR isoform expression. Endogenous enhancer deletion of the multi-UTR gene PTEN did not impair transcript production but prevented 3'UTR isoform switching which was recapitulated by silencing of an enhancer-bound transcription factor. In reporter assays, enhancers increase transcript production when paired with single-UTR gene promoters. However, when combined with multi-UTR gene promoters, they change 3'UTR isoform expression by increasing 3' end processing activity of polyadenylation sites. Processing activity of polyadenylation sites is affected by transcription factors, including NF-κB and MYC, transcription elongation factors, chromatin remodelers, and histone acetyltransferases. As endogenous cell type-specific enhancers are associated with genes that increase their short 3'UTRs in a cell type-specific manner, our data suggest that transcriptional enhancers integrate cellular signals to regulate cell type-and condition-specific 3'UTR isoform expression.
Collapse
Affiliation(s)
- Buki Kwon
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Mervin M Fansler
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Graduate College, New York, NY, 10021, USA
| | - Neil D Patel
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Jihye Lee
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Weirui Ma
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Christine Mayr
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA.
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Graduate College, New York, NY, 10021, USA.
| |
Collapse
|
38
|
Yu Z, Hong L, Li QQ. Signatures of mRNA Alternative Polyadenylation in Arabidopsis Leaf Development. Front Genet 2022; 13:863253. [PMID: 35559042 PMCID: PMC9086830 DOI: 10.3389/fgene.2022.863253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 03/15/2022] [Indexed: 11/26/2022] Open
Abstract
Alternative polyadenylation (APA) of pre-mRNA is an important co-transcriptional mechanism that modulates gene expression, leading to transcriptomic and functional diversities. The role of APA in Arabidopsis leaf development, however, remains elusive. We applied a poly(A)-tag sequencing (PAT-seq) technique to characterize APA-mediated regulation events in cotyledon and in five stages of true leaf development. Over 60% APA was identified in genes expressed in leaves, consistent with the results in previous publications. However, a reduced APA level was detected in younger leaves, reaching 44% in the 18th true leaf. Importantly, we also found that >70% of the poly(A) site usages were altered in the second true leaf relative to the cotyledon. Compared with the cotyledon, more genes in the second true leaf tended to use the distal site of 3′UTR, but this was not found in pairwise comparison among other true leaves. In addition, a significant APA gene was found to be decreased in a pairwise comparison among true leaves, including differentially expressed genes. The APA genes identified herein were associated with specific biological processes, including metabolic and cellular processes and response to stimuli and hormones. These results provide a new insight into the regulation of Arabidopsis leaf development through APA.
Collapse
Affiliation(s)
- Zhibo Yu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Liwei Hong
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Qingshun Q Li
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, China.,Graduate College of Biomedical Sciences, Western University of Health Sciences, Pomona, CA, United States
| |
Collapse
|
39
|
Leveraging omic features with F3UTER enables identification of unannotated 3'UTRs for synaptic genes. Nat Commun 2022; 13:2270. [PMID: 35477703 PMCID: PMC9046390 DOI: 10.1038/s41467-022-30017-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 03/18/2022] [Indexed: 11/08/2022] Open
Abstract
There is growing evidence for the importance of 3' untranslated region (3'UTR) dependent regulatory processes. However, our current human 3'UTR catalogue is incomplete. Here, we develop a machine learning-based framework, leveraging both genomic and tissue-specific transcriptomic features to predict previously unannotated 3'UTRs. We identify unannotated 3'UTRs associated with 1,563 genes across 39 human tissues, with the greatest abundance found in the brain. These unannotated 3'UTRs are significantly enriched for RNA binding protein (RBP) motifs and exhibit high human lineage-specificity. We find that brain-specific unannotated 3'UTRs are enriched for the binding motifs of important neuronal RBPs such as TARDBP and RBFOX1, and their associated genes are involved in synaptic function. Our data is shared through an online resource F3UTER ( https://astx.shinyapps.io/F3UTER/ ). Overall, our data improves 3'UTR annotation and provides additional insights into the mRNA-RBP interactome in the human brain, with implications for our understanding of neurological and neurodevelopmental diseases.
Collapse
|
40
|
Jobbins AM, Haberman N, Artigas N, Amourda C, Paterson HAB, Yu S, Blackford SJI, Montoya A, Dore M, Wang YF, Sardini A, Cebola I, Zuber J, Rashid ST, Lenhard B, Vernia S. Dysregulated RNA polyadenylation contributes to metabolic impairment in non-alcoholic fatty liver disease. Nucleic Acids Res 2022; 50:3379-3393. [PMID: 35293570 PMCID: PMC8989518 DOI: 10.1093/nar/gkac165] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 02/16/2022] [Accepted: 03/09/2022] [Indexed: 11/19/2022] Open
Abstract
Pre-mRNA processing is an essential mechanism for the generation of mature mRNA and the regulation of gene expression in eukaryotic cells. While defects in pre-mRNA processing have been implicated in a number of diseases their involvement in metabolic pathologies is still unclear. Here, we show that both alternative splicing and alternative polyadenylation, two major steps in pre-mRNA processing, are significantly altered in non-alcoholic fatty liver disease (NAFLD). Moreover, we find that Serine and Arginine Rich Splicing Factor 10 (SRSF10) binding is enriched adjacent to consensus polyadenylation motifs and its expression is significantly decreased in NAFLD, suggesting a role mediating pre-mRNA dysregulation in this condition. Consistently, inactivation of SRSF10 in mouse and human hepatocytes in vitro, and in mouse liver in vivo, was found to dysregulate polyadenylation of key metabolic genes such as peroxisome proliferator-activated receptor alpha (PPARA) and exacerbate diet-induced metabolic dysfunction. Collectively our work implicates dysregulated pre-mRNA polyadenylation in obesity-induced liver disease and uncovers a novel role for SRSF10 in this process.
Collapse
Affiliation(s)
- Andrew M Jobbins
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Nejc Haberman
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Natalia Artigas
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Christopher Amourda
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Helen A B Paterson
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Sijia Yu
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Samuel J I Blackford
- Department of Metabolism, Digestion & Reproduction, Imperial College London, London W12 0NN, UK
| | - Alex Montoya
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Marian Dore
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Yi-Fang Wang
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Alessandro Sardini
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Inês Cebola
- Section of Genetics and Genomics, Department of Metabolism, Digestion & Reproduction, Imperial College London, London W12 0NN, UK
| | - Johannes Zuber
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), 1030 Vienna, Austria
| | - Sheikh Tamir Rashid
- Department of Metabolism, Digestion & Reproduction, Imperial College London, London W12 0NN, UK
| | - Boris Lenhard
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | - Santiago Vernia
- MRC London Institute of Medical Sciences, Du Cane Road, London W12 0NN, UK.,Institute of Clinical Sciences, Imperial College London, Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| |
Collapse
|
41
|
Xu SM, Curry-Hyde A, Sytnyk V, Janitz M. RNA polyadenylation patterns in the human transcriptome. Gene 2022; 816:146133. [PMID: 34998928 DOI: 10.1016/j.gene.2021.146133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 12/03/2021] [Accepted: 12/20/2021] [Indexed: 11/30/2022]
Abstract
The eukaryotic transcriptome undergoes various post-transcriptional modifications which assists gene expression. Polyadenylation is a molecular process occurring at the 3'-end of the RNA molecule which involves the poly(A) polymerase attaching adenine monophosphate molecules in a chain-like fashion to assemble a poly(A) tail. Multiple RNA isoforms are produced with differing 3'-UTR and exonic compositions through alternative polyadenylation (APA) which enhances the diversification of alternatively spliced mRNA transcripts. To study polyadenylation patterns, novel methods have been developed using short-read and long-read sequencing technologies to analyse the 3'-ends of the transcript. Recent studies have identified unique polyadenylation patterns in different cellular functions, including oncogenic activity, which could prove valuable in the understanding of medical genetics, particularly in the discovery of biomarkers in diseased states. We present a review of current literature reporting on polyadenylation and the biological relevance in the mammalian transcriptome, with a focus on the human transcriptome. Additionally, we have explored the various methods available to detect polyadenylation patterns using second and third generation sequencing technologies.
Collapse
Affiliation(s)
- Si-Mei Xu
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Australia
| | - Ashton Curry-Hyde
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Australia
| | - Vladimir Sytnyk
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Australia
| | - Michael Janitz
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Australia; Paul-Flechsig-Institute for Brain Research, University of Leipzig, Leipzig, Germany.
| |
Collapse
|
42
|
LC3B is an RNA-binding protein to trigger rapid mRNA degradation during autophagy. Nat Commun 2022; 13:1436. [PMID: 35302060 PMCID: PMC8931120 DOI: 10.1038/s41467-022-29139-1] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 02/23/2022] [Indexed: 12/13/2022] Open
Abstract
LC3/ATG8 has long been appreciated to play a central role in autophagy, by which a variety of cytoplasmic materials are delivered to lysosomes and eventually degraded. However, information on the molecular functions of LC3 in RNA biology is very limited. Here, we show that LC3B is an RNA-binding protein that directly binds to mRNAs with a preference for a consensus AAUAAA motif corresponding to a polyadenylation sequence. Autophagic activation promotes an association between LC3B and target mRNAs and triggers rapid degradation of target mRNAs in a CCR4-NOT–dependent manner before autolysosome formation. Furthermore, our transcriptome-wide analysis reveals that PRMT1 mRNA, which encodes a negative regulator of autophagy, is one of the major substrates. Rapid degradation of PRMT1 mRNA by LC3B facilitates autophagy. Collectively, we demonstrate that LC3B acts as an RNA-binding protein and an mRNA decay factor necessary for efficient autophagy. LC3/ATG8 plays an essential role in autophagy. Here the authors show that LC3B exhibits RNA-binding ability and induces rapid degradation of target mRNAs via autophagic activation, highlighting the interplay between autophagy and RNA biology.
Collapse
|
43
|
Implications of Poly(A) Tail Processing in Repeat Expansion Diseases. Cells 2022; 11:cells11040677. [PMID: 35203324 PMCID: PMC8870147 DOI: 10.3390/cells11040677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 02/11/2022] [Accepted: 02/13/2022] [Indexed: 11/21/2022] Open
Abstract
Repeat expansion diseases are a group of more than 40 disorders that affect mainly the nervous and/or muscular system and include myotonic dystrophies, Huntington’s disease, and fragile X syndrome. The mutation-driven expanded repeat tract occurs in specific genes and is composed of tri- to dodeca-nucleotide-long units. Mutant mRNA is a pathogenic factor or important contributor to the disease and has great potential as a therapeutic target. Although repeat expansion diseases are quite well known, there are limited studies concerning polyadenylation events for implicated transcripts that could have profound effects on transcript stability, localization, and translation efficiency. In this review, we briefly present polyadenylation and alternative polyadenylation (APA) mechanisms and discuss their role in the pathogenesis of selected diseases. We also discuss several methods for poly(A) tail measurement (both transcript-specific and transcriptome-wide analyses) and APA site identification—the further development and use of which may contribute to a better understanding of the correlation between APA events and repeat expansion diseases. Finally, we point out some future perspectives on the research into repeat expansion diseases, as well as APA studies.
Collapse
|
44
|
Inferring protein expression changes from mRNA in Alzheimer's dementia using deep neural networks. Nat Commun 2022; 13:655. [PMID: 35115553 PMCID: PMC8814036 DOI: 10.1038/s41467-022-28280-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Accepted: 01/19/2022] [Indexed: 12/30/2022] Open
Abstract
Identifying the molecular systems and proteins that modify the progression of Alzheimer's disease and related dementias (ADRD) is central to drug target selection. However, discordance between mRNA and protein abundance, and the scarcity of proteomic data, has limited our ability to advance candidate targets that are mainly based on gene expression. Therefore, by using a deep neural network that predicts protein abundance from mRNA expression, here we attempt to track the early protein drivers of ADRD. Specifically, by applying the clei2block deep learning model to 1192 brain RNA-seq samples, we identify protein modules and disease-associated expression changes that were not directly observed at the mRNA level. Moreover, pseudo-temporal trajectory inference based on the predicted proteome became more closely correlated with cognitive decline and hippocampal atrophy compared to RNA-based trajectories. This suggests that the predicted changes in protein expression could provide a better molecular representation of ADRD progression. Furthermore, overlaying clinical traits on protein pseudotime trajectory identifies protein modules altered before cognitive impairment. These results demonstrate how our method can be used to identify potential early protein drivers and possible drug targets for treating and/or preventing ADRD.
Collapse
|
45
|
Martinez NM, Su A, Burns MC, Nussbacher JK, Schaening C, Sathe S, Yeo GW, Gilbert WV. Pseudouridine synthases modify human pre-mRNA co-transcriptionally and affect pre-mRNA processing. Mol Cell 2022; 82:645-659.e9. [PMID: 35051350 PMCID: PMC8859966 DOI: 10.1016/j.molcel.2021.12.023] [Citation(s) in RCA: 74] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 11/04/2021] [Accepted: 12/17/2021] [Indexed: 02/05/2023]
Abstract
Pseudouridine is a modified nucleotide that is prevalent in human mRNAs and is dynamically regulated. Here, we investigate when in their life cycle mRNAs become pseudouridylated to illuminate the potential regulatory functions of endogenous mRNA pseudouridylation. Using single-nucleotide resolution pseudouridine profiling on chromatin-associated RNA from human cells, we identified pseudouridines in nascent pre-mRNA at locations associated with alternatively spliced regions, enriched near splice sites, and overlapping hundreds of binding sites for RNA-binding proteins. In vitro splicing assays establish a direct effect of individual endogenous pre-mRNA pseudouridines on splicing efficiency. We validate hundreds of pre-mRNA sites as direct targets of distinct pseudouridine synthases and show that PUS1, PUS7, and RPUSD4-three pre-mRNA-modifying pseudouridine synthases with tissue-specific expression-control widespread changes in alternative pre-mRNA splicing and 3' end processing. Our results establish a vast potential for cotranscriptional pre-mRNA pseudouridylation to regulate human gene expression via alternative pre-mRNA processing.
Collapse
Affiliation(s)
- Nicole M Martinez
- Yale School of Medicine, Department of Molecular Biophysics & Biochemistry, New Haven, CT 06520, USA
| | - Amanda Su
- Yale School of Medicine, Department of Molecular Biophysics & Biochemistry, New Haven, CT 06520, USA
| | - Margaret C Burns
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92037, USA; Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA; Institute for Genomic Medicine, University of California, San Diego, La Jolla, CA 92037, USA
| | - Julia K Nussbacher
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92037, USA; Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA; Institute for Genomic Medicine, University of California, San Diego, La Jolla, CA 92037, USA
| | - Cassandra Schaening
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Shashank Sathe
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92037, USA; Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA; Institute for Genomic Medicine, University of California, San Diego, La Jolla, CA 92037, USA
| | - Gene W Yeo
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92037, USA; Stem Cell Program, University of California, San Diego, La Jolla, CA 92037, USA; Institute for Genomic Medicine, University of California, San Diego, La Jolla, CA 92037, USA.
| | - Wendy V Gilbert
- Yale School of Medicine, Department of Molecular Biophysics & Biochemistry, New Haven, CT 06520, USA.
| |
Collapse
|
46
|
Nabeel-Shah S, Lee H, Ahmed N, Burke GL, Farhangmehr S, Ashraf K, Pu S, Braunschweig U, Zhong G, Wei H, Tang H, Yang J, Marcon E, Blencowe BJ, Zhang Z, Greenblatt JF. SARS-CoV-2 nucleocapsid protein binds host mRNAs and attenuates stress granules to impair host stress response. iScience 2022; 25:103562. [PMID: 34901782 PMCID: PMC8642831 DOI: 10.1016/j.isci.2021.103562] [Citation(s) in RCA: 55] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Revised: 10/25/2021] [Accepted: 12/01/2021] [Indexed: 12/22/2022] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) nucleocapsid (N) protein is essential for viral replication, making it a promising target for antiviral drug and vaccine development. SARS-CoV-2 infected patients exhibit an uncoordinated immune response; however, the underlying mechanistic details of this imbalance remain obscure. Here, starting from a functional proteomics workflow, we cataloged the protein-protein interactions of SARS-CoV-2 proteins, including an evolutionarily conserved specific interaction of N with the stress granule resident proteins G3BP1 and G3BP2. N localizes to stress granules and sequesters G3BPs away from their typical interaction partners, thus attenuating stress granule formation. We found that N binds directly to host mRNAs in cells, with a preference for 3' UTRs, and modulates target mRNA stability. We show that the N protein rewires the G3BP1 mRNA-binding profile and suppresses the physiological stress response of host cells, which may explain the imbalanced immune response observed in SARS-CoV-2 infected patients.
Collapse
Affiliation(s)
- Syed Nabeel-Shah
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Hyunmin Lee
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Computer Sciences, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Nujhat Ahmed
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Giovanni L Burke
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Shaghayegh Farhangmehr
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Kanwal Ashraf
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Shuye Pu
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | | | - Guoqing Zhong
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Hong Wei
- School of Mathematical Sciences, Nankai University, Tianjin 300071, China
| | - Hua Tang
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Jianyi Yang
- School of Mathematical Sciences, Nankai University, Tianjin 300071, China
| | - Edyta Marcon
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Benjamin J Blencowe
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Zhaolei Zhang
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
- Department of Computer Sciences, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Jack F Greenblatt
- Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| |
Collapse
|
47
|
Fiszbein A, McGurk M, Calvo-Roitberg E, Kim G, Burge CB, Pai AA. Widespread occurrence of hybrid internal-terminal exons in human transcriptomes. SCIENCE ADVANCES 2022; 8:eabk1752. [PMID: 35044812 PMCID: PMC8769537 DOI: 10.1126/sciadv.abk1752] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Accepted: 11/23/2021] [Indexed: 06/12/2023]
Abstract
Messenger RNA isoform differences are predominantly driven by alternative first, internal, and last exons. Despite the importance of classifying exons to understand isoform structure, few tools examine isoform-specific exon usage. We recently observed that alternative transcription start sites often arise near internal exons, often creating “hybrid” first/internal exons. To systematically detect hybrid exons, we built the hybrid-internal-terminal (HIT) pipeline to classify exons depending on their isoform-specific usage. On the basis of splice junction reads in RNA sequencing data and probabilistic modeling, the HIT index identified thousands of previously misclassified hybrid first-internal and internal-last exons. Hybrid exons are enriched in long genes and genes involved in RNA splicing and have longer flanking introns and strong splice sites. Their usage varies considerably across human tissues. By developing the first method to classify exons according to isoform contexts, our findings document the occurrence of hybrid exons, a common quirk of the human transcriptome.
Collapse
Affiliation(s)
- Ana Fiszbein
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Biology, Boston University, Boston, MA, USA
| | - Michael McGurk
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | | | - GyeungYun Kim
- Department of Biology, Boston University, Boston, MA, USA
| | - Christopher B. Burge
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Athma A. Pai
- RNA Therapeutics Institute, University of Massachusetts Medical School, Worcester, MA, USA
| |
Collapse
|
48
|
Bae B, Miura P. CRISPR-Mediated Knockout of Long 3' UTR mRNA Isoforms in mESC-Derived Neurons. Front Genet 2022; 12:789434. [PMID: 34976020 PMCID: PMC8718760 DOI: 10.3389/fgene.2021.789434] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 11/18/2021] [Indexed: 12/15/2022] Open
Abstract
Alternative cleavage and polyadenylation (APA) is pervasive, occurring for more than 70% of human and mouse genes. Distal poly(A) site selection to generate longer 3′ UTR mRNA isoforms is prevalent in the nervous system, affecting thousands of genes. Here, we establish mouse embryonic stem cell (mESC)-derived neurons (mES-neurons) as a suitable system to study long 3′ UTR isoforms. RNA-seq analysis revealed that mES-neurons show widespread 3′ UTR lengthening that closely resembles APA patterns found in mouse cortex. mESCs are highly amenable to genetic manipulation. We present a method to eliminate long 3′ UTR isoform expression using CRISPR/Cas9 editing. This approach can lead to clones with the desired deletion within several weeks. We demonstrate this strategy on the Mprip gene as a proof-of-principle. To confirm loss of long 3′ UTR expression and the absence of cryptic poly(A) site usage stemming from the CRISPR deletion, we present a simple and cost-efficient targeted long-read RNA-sequencing strategy using the Oxford Nanopore Technologies platform. Using this method, we confirmed specific loss of the Mprip long 3′ UTR isoform. CRISPR gene editing of mESCs thus serves as a highly relevant platform for studying the molecular and cellular functions of long 3′ UTR mRNA isoforms.
Collapse
Affiliation(s)
- Bongmin Bae
- Department of Biology, University of Nevada, Reno, Reno, NV, United States
| | - Pedro Miura
- Department of Biology, University of Nevada, Reno, Reno, NV, United States
| |
Collapse
|
49
|
Zhou R, Xiao X, He P, Zhao Y, Xu M, Zheng X, Yang R, Chen S, Zhou L, Zhang D, Yang Q, Song J, Tang C, Zhang Y, Lin JW, Cheng L, Chen L. OUP accepted manuscript. Nucleic Acids Res 2022; 50:e66. [PMID: 35288753 PMCID: PMC9226526 DOI: 10.1093/nar/gkac167] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Revised: 01/26/2022] [Accepted: 03/09/2022] [Indexed: 11/14/2022] Open
Abstract
Alternative polyadenylation increases transcript diversities at the 3’ end, regulating biological processes including cell differentiation, embryonic development and cancer progression. Here, we present a Bayesian method SCAPE, which enables de novo identification and quantification of polyadenylation (pA) sites at single-cell level by utilizing insert size information. We demonstrated its accuracy and robustness and identified 31 558 sites from 36 mouse organs, 43.8% (13 807) of which were novel. We illustrated that APA isoforms were associated with miRNAs binding and regulated in tissue-, cell type-and tumor-specific manners where no difference was found at gene expression level, providing an extra layer of information for cell clustering. Furthermore, we found genome-wide dynamic changes of APA usage during erythropoiesis and induced pluripotent stem cell (iPSC) differentiation, suggesting APA contributes to the functional flexibility and diversity of single cells. We expect SCAPE to aid the analyses of cellular dynamics and diversities in health and disease.
Collapse
Affiliation(s)
- Ran Zhou
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Xia Xiao
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Ping He
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Yuancun Zhao
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Mengying Xu
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Xiuran Zheng
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Ruirui Yang
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Shasha Chen
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Lifang Zhou
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Dan Zhang
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Qingxin Yang
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Junwei Song
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Chao Tang
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Yiming Zhang
- Key Laboratory of Birth Defects and Related Diseases of Women and Children of MOE, Department of Laboratory Medicine, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, Sichuan 610041, China
| | - Jing-wen Lin
- To whom correspondence should be addressed. Tel: +86 028 8546 8389;
| | - Lu Cheng
- Correspondence may also be addressed to Lu Cheng.
| | - Lu Chen
- Correspondence may also be addressed to Lu Chen.
| |
Collapse
|
50
|
Ju CJT, Jiang JY, Li R, Li Z, Wang W. TahcoRoll: fast genomic signature profiling via thinned automaton and rolling hash. MEDICAL REVIEW (2021) 2021; 1:114-125. [PMID: 35881666 PMCID: PMC9027990 DOI: 10.1515/mr-2021-0016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 11/11/2021] [Indexed: 12/04/2022]
Abstract
Objectives Genomic signatures like k-mers have become one of the most prominent approaches to describe genomic data. As a result, myriad real-world applications, such as the construction of de Bruijn graphs in genome assembly, have been benefited by recognizing genomic signatures. In other words, an efficient approach of genomic signature profiling is an essential need for tackling high-throughput sequencing reads. However, most of the existing approaches only recognize fixed-size k-mers while many research studies have shown the importance of considering variable-length k-mers. Methods In this paper, we present a novel genomic signature profiling approach, TahcoRoll, by extending the Aho-Corasick algorithm (AC) for the task of profiling variable-length k-mers. We first group nucleotides into two clusters and represent each cluster with a bit. The rolling hash technique is further utilized to encode signatures and read patterns for efficient matching. Results In extensive experiments, TahcoRoll significantly outperforms the most state-of-the-art k-mer counters and has the capability of processing reads across different sequencing platforms on a budget desktop computer. Conclusions The single-thread version of TahcoRoll is as efficient as the eight-thread version of the state-of-the-art, JellyFish, while the eight-thread TahcoRoll outperforms the eight-thread JellyFish by at least four times.
Collapse
Affiliation(s)
- Chelsea J.-T. Ju
- Department of Computer Science, University of California, Los Angeles, USA
| | - Jyun-Yu Jiang
- Department of Computer Science, University of California, Los Angeles, USA
| | - Ruirui Li
- Department of Computer Science, University of California, Los Angeles, USA
| | - Zeyu Li
- Department of Computer Science, University of California, Los Angeles, USA
| | - Wei Wang
- Department of Computer Science, University of California, Los Angeles, USA
| |
Collapse
|