1
|
Qiao P, Zhang C, Shi Y, Du H. The role of alternative polyadenylation in breast cancer. Front Genet 2024; 15:1377275. [PMID: 38939531 PMCID: PMC11208690 DOI: 10.3389/fgene.2024.1377275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Accepted: 05/24/2024] [Indexed: 06/29/2024] Open
Abstract
Breast cancer (BC), as a highly prevalent malignant tumor worldwide, is still unclear in its pathogenesis and has poor therapeutic outcomes. Alternative polyadenylation (APA) is a post-transcriptional regulatory mechanism widely found in eukaryotes. Precursor mRNA (pre-mRNA) undergoes the APA process to generate multiple mRNA isoforms with different coding regions or 3'UTRs, thereby greatly increasing the diversity and complexity of the eukaryotic transcriptome and proteome. Studies have shown that APA is involved in the progression of various diseases, including cancer, and plays a crucial role. Therefore, clarifying the biological mechanisms of APA and its regulators in breast cancer will help to comprehensively understand the pathogenesis of breast cancer and provide new ideas for its prevention and treatment.
Collapse
Affiliation(s)
- Ping Qiao
- Department of Laboratory, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, China
| | - Caihong Zhang
- Department of Laboratory, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, China
| | - Yingxu Shi
- Department of Laboratory, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, China
| | - Hua Du
- Department of Pathology, Affiliated Hospital of Inner Mongolia Medical University, Hohhot, China
- College of Basic Medicine, Inner Mongolia Medical University, Hohhot, China
| |
Collapse
|
2
|
Jia J, Fan H, Wan X, Fang Y, Li Z, Tang Y, Zhang Y, Huang J, Fang D. FUS reads histone H3K36me3 to regulate alternative polyadenylation. Nucleic Acids Res 2024; 52:5549-5571. [PMID: 38499486 PMCID: PMC11162772 DOI: 10.1093/nar/gkae184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 02/18/2024] [Accepted: 03/04/2024] [Indexed: 03/20/2024] Open
Abstract
Complex organisms generate differential gene expression through the same set of DNA sequences in distinct cells. The communication between chromatin and RNA regulates cellular behavior in tissues. However, little is known about how chromatin, especially histone modifications, regulates RNA polyadenylation. In this study, we found that FUS was recruited to chromatin by H3K36me3 at gene bodies. The H3K36me3 recognition of FUS was mediated by the proline residues in the ZNF domain. After these proline residues were mutated or H3K36me3 was abolished, FUS dissociated from chromatin and bound more to RNA, resulting in an increase in polyadenylation sites far from stop codons genome-wide. A proline mutation corresponding to a mutation in amyotrophic lateral sclerosis contributed to the hyperactivation of mitochondria and hyperdifferentiation in mouse embryonic stem cells. These findings reveal that FUS is an H3K36me3 reader protein that links chromatin-mediated alternative polyadenylation to human disease.
Collapse
Affiliation(s)
- Junqi Jia
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Haonan Fan
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Xinyi Wan
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Yuan Fang
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Zhuoning Li
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Yin Tang
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Yanjun Zhang
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Jun Huang
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
| | - Dong Fang
- Zhejiang Provincial Key Laboratory for Cancer Molecular Cell Biology, Life Sciences Institute, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Department of Medical Oncology, Key Laboratory of Cancer Prevention and Intervention, Ministry of Education, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, Zhejiang, China
| |
Collapse
|
3
|
Chapus F, Giraud G, Huchon P, Rodà M, Grand X, Charre C, Goldsmith C, Roca Suarez AA, Martinez MG, Fresquet J, Diederichs A, Locatelli M, Polvèche H, Scholtès C, Chemin I, Hernandez Vargas H, Rivoire M, Bourgeois CF, Zoulim F, Testoni B. Helicases DDX5 and DDX17 promote heterogeneity in HBV transcription termination in infected human hepatocytes. J Hepatol 2024:S0168-8278(24)00351-9. [PMID: 38782119 DOI: 10.1016/j.jhep.2024.05.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 03/28/2024] [Accepted: 05/02/2024] [Indexed: 05/25/2024]
Abstract
BACKGROUND & AIMS Transcription termination fine-tunes gene expression and contributes to the specification of RNA function in eukaryotic cells. Transcription termination of HBV is subject to the recognition of the canonical polyadenylation signal (cPAS) common to all viral transcripts. However, the regulation of this cPAS and its impact on viral gene expression and replication is currently unknown. METHODS To unravel the regulation of HBV transcript termination, we implemented a 3' RACE (rapid amplification of cDNA ends)-PCR assay coupled to single molecule sequencing both in in vitro-infected hepatocytes and in chronically infected patients. RESULTS The detection of a previously unidentified transcriptional readthrough indicated that the cPAS was not systematically recognized during HBV replication in vitro and in vivo. Gene expression downregulation experiments demonstrated a role for the RNA helicases DDX5 and DDX17 in promoting viral transcriptional readthrough, which was, in turn, associated with HBV RNA destabilization and decreased HBx protein expression. RNA and chromatin immunoprecipitation, together with mutation of the cPAS sequence, suggested a direct role of DDX5 and DDX17 in functionally linking cPAS recognition to transcriptional readthrough, HBV RNA stability and replication. CONCLUSIONS Our findings identify DDX5 and DDX17 as crucial determinants of HBV transcriptional fidelity and as host restriction factors for HBV replication. IMPACT AND IMPLICATIONS HBV covalently closed circular (ccc)DNA degradation or functional inactivation remains the holy grail for the achievement of HBV cure. Transcriptional fidelity is a cornerstone in the regulation of gene expression. Here, we demonstrate that two helicases, DDX5 and DDX17, inhibit recognition of the HBV polyadenylation signal and thereby transcriptional termination, thus decreasing HBV RNA stability and acting as restriction factors for efficient cccDNA transcription and viral replication. The observation that DDX5 and DDX17 are downregulated in patients chronically infected with HBV suggests a role for these helicases in HBV persistence in vivo. These results open new perspectives for researchers aiming at identifying new targets to neutralise cccDNA transcription.
Collapse
Affiliation(s)
- Fleur Chapus
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France
| | - Guillaume Giraud
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Pélagie Huchon
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Mélanie Rodà
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Xavier Grand
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Caroline Charre
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France; Department of Virology, Croix Rousse Hospital, Hospices Civils de Lyon, Lyon, France
| | | | - Armando Andres Roca Suarez
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Maria-Guadalupe Martinez
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France
| | - Judith Fresquet
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France
| | - Audrey Diederichs
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Maëlle Locatelli
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France
| | - Hélène Polvèche
- CECS/AFM, I-Stem, Corbeil-Essonnes, 91100, France; University Claude Bernard of Lyon, Ecole Normale Supérieure de Lyon, CNRS UMR 5239, INSERM U1293, Laboratory of Biology and Modelling of the Cell, 69007, Lyon, France
| | - Caroline Scholtès
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France; Department of Virology, Croix Rousse Hospital, Hospices Civils de Lyon, Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Isabelle Chemin
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | | | - Michel Rivoire
- INSERM U1032, Centre Léon Bérard (CLB), 69008 Lyon, France; The Lyon Hepatology Institute EVEREST, France
| | - Cyril F Bourgeois
- University Claude Bernard of Lyon, Ecole Normale Supérieure de Lyon, CNRS UMR 5239, INSERM U1293, Laboratory of Biology and Modelling of the Cell, 69007, Lyon, France
| | - Fabien Zoulim
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; University of Lyon, UMR_S1052, CRCL, 69008 Lyon, France; Department of Hepatology, Hospices Civils de Lyon, France; The Lyon Hepatology Institute EVEREST, France.
| | - Barbara Testoni
- INSERM U1052, CNRS UMR-5286, Cancer Research Center of Lyon (CRCL), Lyon, France; The Lyon Hepatology Institute EVEREST, France.
| |
Collapse
|
4
|
Pardo-Palacios FJ, Arzalluz-Luque A, Kondratova L, Salguero P, Mestre-Tomás J, Amorín R, Estevan-Morió E, Liu T, Nanni A, McIntyre L, Tseng E, Conesa A. SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms. Nat Methods 2024; 21:793-797. [PMID: 38509328 PMCID: PMC11093726 DOI: 10.1038/s41592-024-02229-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 03/01/2024] [Indexed: 03/22/2024]
Abstract
SQANTI3 is a tool designed for the quality control, curation and annotation of long-read transcript models obtained with third-generation sequencing technologies. Leveraging its annotation framework, SQANTI3 calculates quality descriptors of transcript models, junctions and transcript ends. With this information, potential artifacts can be identified and replaced with reliable sequences. Furthermore, the integrated functional annotation feature enables subsequent functional iso-transcriptomics analyses.
Collapse
Affiliation(s)
- Francisco J Pardo-Palacios
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Valencia, Spain
- Department of Applied Statistics and Operational Research, and Quality, Universitat Politècnica de València, Valencia, Valencia, Spain
| | - Angeles Arzalluz-Luque
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Valencia, Spain
- Department of Applied Statistics and Operational Research, and Quality, Universitat Politècnica de València, Valencia, Valencia, Spain
| | - Liudmyla Kondratova
- Horticultural Sciences Department, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
| | - Pedro Salguero
- Department of Applied Statistics and Operational Research, and Quality, Universitat Politècnica de València, Valencia, Valencia, Spain
| | - Jorge Mestre-Tomás
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Valencia, Spain
| | - Rocío Amorín
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, USA
| | - Eva Estevan-Morió
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Valencia, Spain
| | - Tianyuan Liu
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Valencia, Spain
| | - Adalena Nanni
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA
| | - Lauren McIntyre
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, USA
| | | | - Ana Conesa
- Institute for Integrative Systems Biology, Spanish National Research Council, Paterna, Valencia, Spain.
| |
Collapse
|
5
|
Cackett G, Sýkora M, Portugal R, Dulson C, Dixon L, Werner F. Transcription termination and readthrough in African swine fever virus. Front Immunol 2024; 15:1350267. [PMID: 38545109 PMCID: PMC10965686 DOI: 10.3389/fimmu.2024.1350267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 01/30/2024] [Indexed: 04/13/2024] Open
Abstract
Introduction African swine fever virus (ASFV) is a nucleocytoplasmic large DNA virus (NCLDV) that encodes its own host-like RNA polymerase (RNAP) and factors required to produce mature mRNA. The formation of accurate mRNA 3' ends by ASFV RNAP depends on transcription termination, likely enabled by a combination of sequence motifs and transcription factors, although these are poorly understood. The termination of any RNAP is rarely 100% efficient, and the transcriptional "readthrough" at terminators can generate long mRNAs which may interfere with the expression of downstream genes. ASFV transcriptome analyses reveal a landscape of heterogeneous mRNA 3' termini, likely a combination of bona fide termination sites and the result of mRNA degradation and processing. While short-read sequencing (SRS) like 3' RNA-seq indicates an accumulation of mRNA 3' ends at specific sites, it cannot inform about which promoters and transcription start sites (TSSs) directed their synthesis, i.e., information about the complete and unprocessed mRNAs at nucleotide resolution. Methods Here, we report a rigorous analysis of full-length ASFV transcripts using long-read sequencing (LRS). We systematically compared transcription termination sites predicted from SRS 3' RNA-seq with 3' ends mapped by LRS during early and late infection. Results Using in-vitro transcription assays, we show that recombinant ASFV RNAP terminates transcription at polyT stretches in the non-template strand, similar to the archaeal RNAP or eukaryotic RNAPIII, unaided by secondary RNA structures or predicted viral termination factors. Our results cement this T-rich motif (U-rich in the RNA) as a universal transcription termination signal in ASFV. Many genes share the usage of the same terminators, while genes can also use a range of terminators to generate transcript isoforms varying enormously in length. A key factor in the latter phenomenon is the highly abundant terminator readthrough we observed, which is more prevalent during late compared with early infection. Discussion This indicates that ASFV mRNAs under the control of late gene promoters utilize different termination mechanisms and factors to early promoters and/or that cellular factors influence the viral transcriptome landscape differently during the late stages of infection.
Collapse
Affiliation(s)
- Gwenny Cackett
- Institute for Structural and Molecular Biology, University College London, London, United Kingdom
| | - Michal Sýkora
- Institute for Structural and Molecular Biology, University College London, London, United Kingdom
| | | | - Christopher Dulson
- Institute for Structural and Molecular Biology, University College London, London, United Kingdom
| | - Linda Dixon
- Pirbright Institute, Pirbright, Surrey, United Kingdom
| | - Finn Werner
- Institute for Structural and Molecular Biology, University College London, London, United Kingdom
| |
Collapse
|
6
|
Yeganeh Markid T, Hosseinpour Feizi MA, Talebi M, Rezazadeh M, Khalaj-Kondori M. Gene expression investigation of four key regulators of polyadenylation and alternative adenylation in the periphery of late-onset Alzheimer's disease patients. Gene 2024; 895:148013. [PMID: 37981081 DOI: 10.1016/j.gene.2023.148013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 10/11/2023] [Accepted: 11/15/2023] [Indexed: 11/21/2023]
Abstract
BACKGROUND Alzheimer's disease (AD) is a genetic and sporadic neurodegenerative disease considered by an archetypal cognitive impairment and a decrease in less common cognitive impairment. Notably, the discovery of goals in this paradigm is still a challenge, and understanding basic mechanisms is an important step toward improving disease management. Polyadenylation (PA) and alternative polyadenylation (APA) are two of the most critical RNA processing stages in 3'UTRs that influence various AD-related genes. METHODS In this study, we assessed Cleavage and polyadenylation specificity factors 1 and 6 (CPSF1 and CPSF6), cleavage stimulation factor 1 (CSTF1), and WD Repeat Domain 33 (WDR33) genes expression in the periphery of 50 AD patients and 50 healthy individuals with age and gender-matched by quantitative real-time PCR. RESULTS Comparing AD patients with healthy people using expression analysis revealed a substantial increase in CSTF1 (posterior beta = 0.773, adjusted P-value = 0.042). Significant positive correlations were found between CSTF1 and CPSF1 (r = 0.365, P < 0.001), WDR33 (r = 0.506, P < 0.001), and CPSF6 (r = 0.446, P < 0.001) expression levels. CONCLUSION Although further research is required to determine their potential contribution to AD, our findings offer a fresh perspective on molecular regulatory pathways associated with AD pathogenic mechanisms associated with PA and APA.
Collapse
Affiliation(s)
- Tarlan Yeganeh Markid
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Iran; Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, Iran
| | | | - Mahnaz Talebi
- Neurosciences Research Center (NSRC), Tabriz University of Medical Sciences, Tabriz, Iran
| | - Maryam Rezazadeh
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Iran; Department of Medical Genetics, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran.
| | - Mohammad Khalaj-Kondori
- Department of Animal Biology, Faculty of Natural Sciences, University of Tabriz, Tabriz, Iran.
| |
Collapse
|
7
|
Qin X, Meng C, Li C, Zhao W, Ren S, Cao S, Zhou G. Alternative Polyadenylation of Malic Enzyme 1 Is Essential for Accelerated Adipogenesis. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:20815-20825. [PMID: 38088871 DOI: 10.1021/acs.jafc.3c06289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2023]
Abstract
Understanding the mechanism of adipogenesis is an important basis for improving meat quality traits of livestock. Alternative polyadenylation (APA) is a vital mechanism to regulate the expression of eukaryotic genes. However, how the individual APA functions in adipogenesis remains elusive. This study was intended to investigate the effect of malic enzyme 1 (ME1) APA on adipogenesis. Here, intracellular lipid droplets were stained using Oil red O. 3' RACE was used to verify APA events of the ME1 gene. Interactions between ME1 3' untranslated region (3' UTR)-APA isoforms and miRNAs, as well as differential expression of isoforms, were examined using dual-luciferase reporter and molecular experiments. The mechanism of ME1 APA on adipogenesis was explored by gain and loss of function assays. In this study, two ME1 isoforms with different 3' UTR lengths were detected during adipogenesis. Moreover, the ME1 isoform with a short 3' UTR was significantly upregulated compared with the one with a long 3' UTR. Mechanistically, only the long ME1 isoform was targeted by miR-153-3p to attenuate adipogenesis, while the short one escaped the regulation of miR-153-3p to accelerate adipogenesis. Our results reveal a novel mechanism of ME1 APA in regulating adipogenesis.
Collapse
Affiliation(s)
- Xuyong Qin
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| | - Chaoqun Meng
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| | - Chengping Li
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| | - Wei Zhao
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| | - Shizhong Ren
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| | - Shujun Cao
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| | - Guoli Zhou
- College of Life Science, Liaocheng University, Liaocheng 252000, China
| |
Collapse
|
8
|
Shiferaw HK, Hong CS, Cooper DN, Johnston JJ, NISC, Biesecker LG. Genome-wide identification of dominant polyadenylation hexamers for use in variant classification. Hum Mol Genet 2023; 32:3211-3224. [PMID: 37606238 PMCID: PMC10656703 DOI: 10.1093/hmg/ddad136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 07/17/2023] [Accepted: 08/14/2023] [Indexed: 08/23/2023] Open
Abstract
Polyadenylation is an essential process for the stabilization and export of mRNAs to the cytoplasm and the polyadenylation signal hexamer (herein referred to as hexamer) plays a key role in this process. Yet, only 14 Mendelian disorders have been associated with hexamer variants. This is likely an under-ascertainment as hexamers are not well defined and not routinely examined in molecular analysis. To facilitate the interrogation of putatively pathogenic hexamer variants, we set out to define functionally important hexamers genome-wide as a resource for research and clinical testing interrogation. We identified predominant polyA sites (herein referred to as pPAS) and putative predominant hexamers across protein coding genes (PAS usage >50% per gene). As a measure of the validity of these sites, the population constraint of 4532 predominant hexamers were measured. The predominant hexamers had fewer observed variants compared to non-predominant hexamers and trimer controls, and CADD scores for variants in these hexamers were significantly higher than controls. Exome data for 1477 individuals were interrogated for hexamer variants and transcriptome data were generated for 76 individuals with 65 variants in predominant hexamers. 3' RNA-seq data showed these variants resulted in alternate polyadenylation events (38%) and in elongated mRNA transcripts (12%). Our list of pPAS and predominant hexamers are available in the UCSC genome browser and on GitHub. We suggest this list of predominant hexamers can be used to interrogate exome and genome data. Variants in these predominant hexamers should be considered candidates for pathogenic variation in human disease, and to that end we suggest pathogenicity criteria for classifying hexamer variants.
Collapse
Affiliation(s)
- Henoke K Shiferaw
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, United States
| | - Celine S Hong
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, United States
| | - David N Cooper
- Institute of Medical Genetics, School of Medicine, Cardiff University, Heath Park, Cardiff CF14 4XN, United Kingdom
| | - Jennifer J Johnston
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, United States
| | - NISC
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, National Institutes of Health, Bethesda, MD 20892, United States
| | - Leslie G Biesecker
- Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, United States
| |
Collapse
|
9
|
Marjamaa A, Gibbs B, Kotrba C, Masamha CP. The role and impact of alternative polyadenylation and miRNA regulation on the expression of the multidrug resistance-associated protein 1 (MRP-1/ABCC1) in epithelial ovarian cancer. Sci Rep 2023; 13:17476. [PMID: 37838788 PMCID: PMC10576765 DOI: 10.1038/s41598-023-44548-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 10/10/2023] [Indexed: 10/16/2023] Open
Abstract
The ATP-binding cassette transporter (ABCC1) is associated with poor survival and chemotherapy drug resistance in high grade serous ovarian cancer (HGSOC). The mechanisms driving ABCC1 expression are poorly understood. Alternative polyadenylation (APA) can give rise to ABCC1 mRNAs which differ only in the length of their 3'untranslated regions (3'UTRs) in a process known as 3'UTR-APA. Like other ABC transporters, shortening of the 3'UTR of ABCC1 through 3'UTR-APA would eliminate microRNA binding sites found within the longer 3'UTRs, hence eliminating miRNA regulation and altering gene expression. We found that the HGSOC cell lines Caov-3 and Ovcar-3 express higher levels of ABCC1 protein than normal cells. APA of ABCC1 occurs in all three cell lines resulting in mRNAs with both short and long 3'UTRs. In Ovcar-3, mRNAs with shorter 3'UTRs dominate resulting in a six-fold increase in protein expression. We were able to show that miR-185-5p and miR-326 both target the ABCC1 3'UTR. Hence, 3'UTR-APA should be considered as an important regulator of ABCC1 expression in HGSOC. Both HGSOC cell lines are cisplatin resistant, and we used erastin to induce ferroptosis, an alternative form of cell death. We showed that we could induce ferroptosis and sensitize the cisplatin resistant cells to cisplatin by using erastin. Knocking down ABCC1 resulted in decreased cell viability, but did not contribute to erastin induced ferroptosis.
Collapse
Affiliation(s)
- Audrey Marjamaa
- Department of Chemistry and Biochemistry, Butler University, Indianapolis, IN, 46208, USA
| | - Bettine Gibbs
- Department of Pharmaceutical Sciences, Butler University, Indianapolis, IN, 46208, USA
- Department of Microbiology, Harvard Medical School, Boston, MA, 02115, USA
| | - Chloe Kotrba
- Department of Pharmaceutical Sciences, Butler University, Indianapolis, IN, 46208, USA
| | | |
Collapse
|
10
|
Swale C, Hakimi MA. 3'-end mRNA processing within apicomplexan parasites, a patchwork of classic, and unexpected players. WILEY INTERDISCIPLINARY REVIEWS. RNA 2023; 14:e1783. [PMID: 36994829 DOI: 10.1002/wrna.1783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 01/17/2023] [Accepted: 01/25/2023] [Indexed: 03/31/2023]
Abstract
The 3'-end processing of mRNA is a co-transcriptional process that leads to the formation of a poly-adenosine tail on the mRNA and directly controls termination of the RNA polymerase II juggernaut. This process involves a megadalton complex composed of cleavage and polyadenylation specificity factors (CPSFs) that are able to recognize cis-sequence elements on nascent mRNA to then carry out cleavage and polyadenylation reactions. Recent structural and biochemical studies have defined the roles played by different subunits of the complex and provided a comprehensive mechanistic understanding of this machinery in yeast or metazoans. More recently, the discovery of small molecule inhibitors of CPSF function in Apicomplexa has stimulated interest in studying the specificities of this ancient eukaryotic machinery in these organisms. Although its function is conserved in Apicomplexa, the CPSF complex integrates a novel reader of the N6-methyladenosine (m6A). This feature, inherited from the plant kingdom, bridges m6A metabolism directly to 3'-end processing and by extension, to transcription termination. In this review, we will examine convergence and divergence of CPSF within the apicomplexan parasites and explore the potential of small molecule inhibition of this machinery within these organisms. This article is categorized under: RNA Processing > 3' End Processing RNA Processing > RNA Editing and Modification.
Collapse
Affiliation(s)
- Christopher Swale
- Team Host-Pathogen Interactions and Immunity to Infection, Institute for Advanced Biosciences, INSERM U1209, CNRS UMR5309, Grenoble Alpes University, Grenoble, France
| | - Mohamed-Ali Hakimi
- Team Host-Pathogen Interactions and Immunity to Infection, Institute for Advanced Biosciences, INSERM U1209, CNRS UMR5309, Grenoble Alpes University, Grenoble, France
| |
Collapse
|
11
|
Moon Y, Burri D, Zavolan M. Identification of experimentally-supported poly(A) sites in single-cell RNA-seq data with SCINPAS. NAR Genom Bioinform 2023; 5:lqad079. [PMID: 37705828 PMCID: PMC10495540 DOI: 10.1093/nargab/lqad079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 08/15/2023] [Accepted: 08/23/2023] [Indexed: 09/15/2023] Open
Abstract
Alternative polyadenylation is a main driver of transcriptome diversity in mammals, generating transcript isoforms with different 3' ends via cleavage and polyadenylation at distinct polyadenylation (poly(A)) sites. The regulation of cell type-specific poly(A) site choice is not completely resolved, and requires quantitative poly(A) site usage data across cell types. 3' end-based single-cell RNA-seq can now be broadly used to obtain such data, enabling the identification and quantification of poly(A) sites with direct experimental support. We propose SCINPAS, a computational method to identify poly(A) sites from scRNA-seq datasets. SCINPAS modifies the read deduplication step to favor the selection of distal reads and extract those with non-templated poly(A) tails. This approach improves the resolution of poly(A) site recovery relative to standard software. SCINPAS identifies poly(A) sites in genic and non-genic regions, providing complementary information relative to other tools. The workflow is modular, and the key read deduplication step is general, enabling the use of SCINPAS in other typical analyses of single cell gene expression. Taken together, we show that SCINPAS is able to identify experimentally-supported, known and novel poly(A) sites from 3' end-based single-cell RNA sequencing data.
Collapse
Affiliation(s)
- Youngbin Moon
- Computational and Systems Biology, Biozentrum University of Basel, Spitalstrasse 41, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Dominik Burri
- Computational and Systems Biology, Biozentrum University of Basel, Spitalstrasse 41, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Mihaela Zavolan
- Computational and Systems Biology, Biozentrum University of Basel, Spitalstrasse 41, CH-4056 Basel, Switzerland
- Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
12
|
Hao S, Zhang L, Zhao D, Zhou J, Ye C, Qu H, Li QQ. Inhibitor AN3661 reveals biological functions of Arabidopsis CLEAVAGE and POLYADENYLATION SPECIFICITY FACTOR 73. PLANT PHYSIOLOGY 2023; 193:537-554. [PMID: 37335917 DOI: 10.1093/plphys/kiad352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 05/09/2023] [Accepted: 05/21/2023] [Indexed: 06/21/2023]
Abstract
Cleavage and polyadenylation specificity factor (CPSF) is a protein complex that plays an essential biochemical role in mRNA 3'-end formation, including poly(A) signal recognition and cleavage at the poly(A) site. However, its biological functions at the organismal level are mostly unknown in multicellular eukaryotes. The study of plant CPSF73 has been hampered by the lethality of Arabidopsis (Arabidopsis thaliana) homozygous mutants of AtCPSF73-I and AtCPSF73-II. Here, we used poly(A) tag sequencing to investigate the roles of AtCPSF73-I and AtCPSF73-II in Arabidopsis treated with AN3661, an antimalarial drug with specificity for parasite CPSF73 that is homologous to plant CPSF73. Direct seed germination on an AN3661-containing medium was lethal; however, 7-d-old seedlings treated with AN3661 survived. AN3661 targeted AtCPSF73-I and AtCPSF73-II, inhibiting growth through coordinating gene expression and poly(A) site choice. Functional enrichment analysis revealed that the accumulation of ethylene and auxin jointly inhibited primary root growth. AN3661 affected poly(A) signal recognition, resulted in lower U-rich signal usage, caused transcriptional readthrough, and increased the distal poly(A) site usage. Many microRNA targets were found in the 3' untranslated region lengthened transcripts; these miRNAs may indirectly regulate the expression of these targets. Overall, this work demonstrates that AtCPSF73 plays important part in co-transcriptional regulation, affecting growth, and development in Arabidopsis.
Collapse
Affiliation(s)
- Saiqi Hao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
| | - Lidan Zhang
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
| | - Danhui Zhao
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
| | - Jiawen Zhou
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
| | - Congting Ye
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
| | - Haidong Qu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
| | - Qingshun Q Li
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystem, College of the Environment and Ecology, Xiamen University, Xiamen, Fujian 361102, China
- Biomedical Sciences, College of Dental Medicine, Western University of Health Sciences, Pomona, CA 91766, USA
| |
Collapse
|
13
|
Verwilt J, Mestdagh P, Vandesompele J. Artifacts and biases of the reverse transcription reaction in RNA sequencing. RNA (NEW YORK, N.Y.) 2023; 29:889-897. [PMID: 36990512 DOI: 10.1261/rna.079623.123] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
RNA sequencing has spurred a significant number of research areas in recent years. Most protocols rely on synthesizing a more stable complementary DNA (cDNA) copy of the RNA molecule during the reverse transcription reaction. The resulting cDNA pool is often wrongfully assumed to be quantitatively and molecularly similar to the original RNA input. Sadly, biases and artifacts confound the resulting cDNA mixture. These issues are often overlooked or ignored in the literature by those that rely on the reverse transcription process. In this review, we confront the reader with intra- and intersample biases and artifacts caused by the reverse transcription reaction during RNA sequencing experiments. To fight the reader's despair, we also provide solutions to most issues and inform on good RNA sequencing practices. We hope the reader can use this review to their advantage, thereby contributing to scientifically sound RNA studies.
Collapse
Affiliation(s)
- Jasper Verwilt
- OncoRNALab, Cancer Research Institute Ghent, 9000 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Center for Medical Genetics, Ghent University, 9000 Ghent, Belgium
| | - Pieter Mestdagh
- OncoRNALab, Cancer Research Institute Ghent, 9000 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Center for Medical Genetics, Ghent University, 9000 Ghent, Belgium
| | - Jo Vandesompele
- OncoRNALab, Cancer Research Institute Ghent, 9000 Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Center for Medical Genetics, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
14
|
Abstract
Formation of the 3' end of a eukaryotic mRNA is a key step in the production of a mature transcript. This process is mediated by a number of protein factors that cleave the pre-mRNA, add a poly(A) tail, and regulate transcription by protein dephosphorylation. Cleavage and polyadenylation specificity factor (CPSF) in humans, or cleavage and polyadenylation factor (CPF) in yeast, coordinates these enzymatic activities with each other, with RNA recognition, and with transcription. The site of pre-mRNA cleavage can strongly influence the translation, stability, and localization of the mRNA. Hence, cleavage site selection is highly regulated. The length of the poly(A) tail is also controlled to ensure that every transcript has a similar tail when it is exported from the nucleus. In this review, we summarize new mechanistic insights into mRNA 3'-end processing obtained through structural studies and biochemical reconstitution and outline outstanding questions in the field.
Collapse
Affiliation(s)
- Vytautė Boreikaitė
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom;
| | - Lori A Passmore
- Medical Research Council Laboratory of Molecular Biology, Cambridge, United Kingdom;
| |
Collapse
|
15
|
Pardo-Palacios FJ, Arzalluz-Luque A, Kondratova L, Salguero P, Mestre-Tomás J, Amorín R, Estevan-Morió E, Liu T, Nanni A, McIntyre L, Tseng E, Conesa A. SQANTI3: curation of long-read transcriptomes for accurate identification of known and novel isoforms. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.17.541248. [PMID: 37398077 PMCID: PMC10312485 DOI: 10.1101/2023.05.17.541248] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The emergence of long-read RNA sequencing (lrRNA-seq) has provided an unprecedented opportunity to analyze transcriptomes at isoform resolution. However, the technology is not free from biases, and transcript models inferred from these data require quality control and curation. In this study, we introduce SQANTI3, a tool specifically designed to perform quality analysis on transcriptomes constructed using lrRNA-seq data. SQANTI3 provides an extensive naming framework to describe transcript model diversity in comparison to the reference transcriptome. Additionally, the tool incorporates a wide range of metrics to characterize various structural properties of transcript models, such as transcription start and end sites, splice junctions, and other structural features. These metrics can be utilized to filter out potential artifacts. Moreover, SQANTI3 includes a Rescue module that prevents the loss of known genes and transcripts exhibiting evidence of expression but displaying low-quality features. Lastly, SQANTI3 incorporates IsoAnnotLite, which enables functional annotation at the isoform level and facilitates functional iso-transcriptomics analyses. We demonstrate the versatility of SQANTI3 in analyzing different data types, isoform reconstruction pipelines, and sequencing platforms, and how it provides novel biological insights into isoform biology. The SQANTI3 software is available at https://github.com/ConesaLab/SQANTI3 .
Collapse
|
16
|
Masamha CP. The emerging roles of CFIm25 (NUDT21/CPSF5) in human biology and disease. WILEY INTERDISCIPLINARY REVIEWS. RNA 2023; 14:e1757. [PMID: 35965101 PMCID: PMC9925614 DOI: 10.1002/wrna.1757] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 07/07/2022] [Accepted: 07/11/2022] [Indexed: 11/11/2022]
Abstract
The mammalian cleavage factor I subunit CFIm25 (NUDT21) binds to the UGUA sequences of precursor RNAs. Traditionally, CFIm25 is known to facilitate 3' end formation of pre-mRNAs resulting in the formation of polyadenylated transcripts. Recent studies suggest that CFIm25 may be involved in the cyclization and hence generation of circular RNAs (circRNAs) that contain UGUA motifs. These circRNAs act as competing endogenous RNAs (ceRNAs) that disrupt the ceRNA-miRNA-mRNA axis. Other emerging roles of CFIm25 include regulating both alternative splicing and alternative polyadenylation (APA). APA generates different sized transcripts that may code for different proteins, or more commonly transcripts that code for the same protein but differ in the length and sequence content of their 3' UTRs (3' UTR-APA). CFIm25 mediated global changes in 3' UTR-APA affect human physiology including spermatogenesis and the determination of cell fate. Deregulation of CFIm25 and changes in 3' UTR-APA have been implicated in several human diseases including cancer. In many cancers, CFIm25 acts as a tumor suppressor. However, there are some cancers where CFIm25 has the opposite effect. Alterations in CFIm25-driven 3' UTR-APA may also play a role in neural dysfunction and fibrosis. CFIm25 mediated 3' UTR-APA changes can be used to generate specific signatures that can be used as potential biomarkers in development and disease. Due to the emerging role of CFIm25 as a regulator of the aforementioned RNA processing events, modulation of CFIm25 levels may be a novel viable therapeutic approach. This article is categorized under: RNA Processing > 3' End Processing RNA in Disease and Development > RNA in Disease.
Collapse
Affiliation(s)
- Chioniso Patience Masamha
- Department of Pharmaceutical Sciences, College of Pharmacy and Health Sciences, Butler University, Indianapolis, Indiana, USA
| |
Collapse
|
17
|
de Felippes FF, Waterhouse PM. Plant terminators: the unsung heroes of gene expression. JOURNAL OF EXPERIMENTAL BOTANY 2023; 74:2239-2250. [PMID: 36477559 PMCID: PMC10082929 DOI: 10.1093/jxb/erac467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 11/25/2022] [Indexed: 06/06/2023]
Abstract
To be properly expressed, genes need to be accompanied by a terminator, a region downstream of the coding sequence that contains the information necessary for the maturation of the mRNA 3' end. The main event in this process is the addition of a poly(A) tail at the 3' end of the new transcript, a critical step in mRNA biology that has important consequences for the expression of genes. Here, we review the mechanism leading to cleavage and polyadenylation of newly transcribed mRNAs and how this process can affect the final levels of gene expression. We give special attention to an aspect often overlooked, the effect that different terminators can have on the expression of genes. We also discuss some exciting findings connecting the choice of terminator to the biogenesis of small RNAs, which are a central part of one of the most important mechanisms of regulation of gene expression in plants.
Collapse
Affiliation(s)
| | - Peter M Waterhouse
- Centre for Agriculture and the Bioeconomy, Institute for Future Environments, Queensland University of Technology (QUT), Brisbane, QLD, Australia
- ARC Centre of Excellence for Plant Success in Nature & Agriculture, QUT, Brisbane, QLD, Australia
| |
Collapse
|
18
|
Rivera-Rivas LA, Arroyo R. Iron restriction increases the expression of a cytotoxic cysteine proteinase TvCP2 by a novel mechanism of tvcp2 mRNA alternative polyadenylation in Trichomonas vaginalis. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2023; 1866:194935. [PMID: 37011833 DOI: 10.1016/j.bbagrm.2023.194935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 03/25/2023] [Accepted: 03/27/2023] [Indexed: 04/05/2023]
Abstract
Trichomonas vaginalis TvCP2 (TVAG_057000) is a cytotoxic cysteine proteinase (CP) expressed under iron-limited conditions. This work aimed to identify one of the mechanisms of tvcp2 gene expression regulation by iron at the posttranscriptional level. We checked tvcp2 mRNA stability under both iron-restricted (IR) and high iron (HI) conditions in the presence of actinomycin D. Greater stability of the tvcp2 mRNA under the IR than in HI conditions was observed, as expected. In silico analysis of the 3' regulatory region showed the presence of two putative polyadenylation signals in the tvcp2 transcript. By 3'-RACE assays, we demonstrated the existence of two isoforms of the tvcp2 mRNA with different 3'-UTR that resulted in more TvCP2 protein under IR than in HI conditions detected by WB assays. Additionally, we searched for homologs of the trichomonad polyadenylation machinery by an in silico analysis in the genome database, TrichDB. 16 genes that encode proteins that could be part of the trichomonad polyadenylation machinery were found. qRT-PCR assays showed that most of these genes were positively regulated by iron. Thus, our results show the presence of alternative polyadenylation as a novel iron posttranscriptional regulatory mechanism in T. vaginalis for the tvcp2 gene expression.
Collapse
Affiliation(s)
- Luis Alberto Rivera-Rivas
- Departamento de Infectómica y Patogénesis Molecular, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Mexico City, Mexico
| | - Rossana Arroyo
- Departamento de Infectómica y Patogénesis Molecular, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional (CINVESTAV-IPN), Mexico City, Mexico.
| |
Collapse
|
19
|
Sharma H, Pani T, Dasgupta U, Batra J, Sharma RD. Prediction of transcript structure and concentration using RNA-Seq data. Brief Bioinform 2023; 24:6995379. [PMID: 36682028 DOI: 10.1093/bib/bbad022] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2022] [Revised: 11/25/2022] [Accepted: 01/06/2023] [Indexed: 01/23/2023] Open
Abstract
Alternative splicing (AS) is a key post-transcriptional modification that helps in increasing protein diversity. Almost 90% of the protein-coding genes in humans are known to undergo AS and code for different transcripts. Some transcripts are associated with diseases such as breast cancer, lung cancer and glioblastoma. Hence, these transcripts can serve as novel therapeutic and prognostic targets for drug discovery. Herein, we have developed a pipeline, Finding Alternative Splicing Events (FASE), as the R package that includes modules to determine the structure and concentration of transcripts using differential AS. To predict the correct structure of expressed transcripts in given conditions, FASE combines the AS events with the information of exons, introns and junctions using graph theory. The estimated concentration of predicted transcripts is reported as the relative expression in terms of log2CPM. Using FASE, we were able to identify several unique transcripts of EMILIN1 and SLK genes in the TCGA-BRCA data, which were validated using RT-PCR. The experimental study demonstrated consistent results, which signify the high accuracy and precision of the developed methods. In conclusion, the developed pipeline, FASE, can efficiently predict novel transcripts that are missed in general transcript-level differential expression analysis. It can be applied selectively from a single gene to simple or complex genome even in multiple experimental conditions for the identification of differential AS-based biomarkers, prognostic targets and novel therapeutics.
Collapse
Affiliation(s)
- Harsh Sharma
- Amity Institute of Integrative Sciences and Health, Amity University Haryana, Gurugram 122413, India
| | - Trishna Pani
- Amity Institute of Integrative Sciences and Health, Amity University Haryana, Gurugram 122413, India
| | - Ujjaini Dasgupta
- Amity Institute of Integrative Sciences and Health, Amity University Haryana, Gurugram 122413, India
| | - Jyotsna Batra
- School of Biomedical Sciences, Institute of Health and Biomedical Innovation (IHBI), Translational Research Institute, Queensland University of Technology (QUT), Brisbane, QLD, Australia
| | - Ravi Datta Sharma
- Amity Institute of Integrative Sciences and Health, Amity University Haryana, Gurugram 122413, India
| |
Collapse
|
20
|
Freire RP, Hernandez-Gonzalez JE, Lima ER, Suzuki MF, de Oliveira JE, Torai LS, Bartolini P, Soares CRJ. Molecular Cloning and AlphaFold Modeling of Thyrotropin (ag-TSH) From the Amazonian Fish Pirarucu ( Arapaima gigas). Bioinform Biol Insights 2023; 17:11779322231154148. [PMID: 36798082 PMCID: PMC9926385 DOI: 10.1177/11779322231154148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 01/14/2023] [Indexed: 02/17/2023] Open
Abstract
Arapaima gigas, known as Pirarucu in Brazil, is one of the largest freshwater fish in the world. Some individuals could reach 3 m in length and weight up to 200 kg. Due to extinction risks and its economic value, the species has been a focus for preservation and reproduction studies. Thyrotropin (TSH) is a glycoprotein hormone formed by 2 subunits α and β whose main activity is related to the synthesis of thyroid hormones (THs)-T3 and T4. In this work, we present a combination of bioinformatics tools to identify Arapaima gigas βTSH (ag-βTSH), modeling its molecular structure and express the recombinant heterodimer form in mammalian cells. Using the combination of computational biology, based on genome-related information, in silico molecular cloning and modeling led to confirm results of the ag-βTSH sequence by reverse transcriptase-polymerase chain reaction (RT-PCR) and transient expression in human embryonic kidney (HEK293F) cells. Molecular cloning of ag-βTSH retrieved 146 amino acids with a signal peptide of 21 amino acid residues and 6 disulfide bonds. The sequence has a similarity to 39 fish species, ranging between 43.1% and 81.6%, whose domains are extremely conserved, such as cystine knot motif and N-glycosylation site. The Arapaima gigas thyrotropin (ag-TSH) model, solved by AlphaFold, was used in molecular dynamics simulations with Scleropages formosus receptor, providing similar values of free energy ΔGbind and ΔGPMF in comparison with Homo sapiens model. The recombinant expression in HEK293F cells reached a yield of 25 mg/L, characterized via chromatographic and physical-chemical techniques. This work shows that other Arapaima gigas proteins could be studied in a similar way, using the combination of these techniques, recovering more information from its genome and improving the reproduction and preservation of this prehistoric fish.
Collapse
Affiliation(s)
- Renan Passos Freire
- Instituto de Pesquisas Energéticas e Nucleares (IPEN-CNEN), São Paulo, Brazil
| | - Jorge Enrique Hernandez-Gonzalez
- Instituto de Biociências, Letras e Ciências Exatas (IBILCE), Universidade Estadual Paulista “Júlio de Mesquita Filho” (UNESP), São Paulo, Brazil
| | - Eliana Rosa Lima
- Instituto de Pesquisas Energéticas e Nucleares (IPEN-CNEN), São Paulo, Brazil
| | | | | | | | - Paolo Bartolini
- Instituto de Pesquisas Energéticas e Nucleares (IPEN-CNEN), São Paulo, Brazil
| | - Carlos Roberto Jorge Soares
- Instituto de Pesquisas Energéticas e Nucleares (IPEN-CNEN), São Paulo, Brazil
- Carlos Roberto Jorge Soares, Biotechnology Center, Instituto de Pesquisas Energéticas e Nucleares (IPEN-CNEN), Av. Prof. Lineu Prestes 2242, Cidade Universitária, São Paulo SP 05508-000, Brazil.
| |
Collapse
|
21
|
Slight Variations in the Sequence Downstream of the Polyadenylation Signal Significantly Increase Transgene Expression in HEK293T and CHO Cells. Int J Mol Sci 2022; 23:ijms232415485. [PMID: 36555130 PMCID: PMC9779314 DOI: 10.3390/ijms232415485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 11/28/2022] [Accepted: 12/01/2022] [Indexed: 12/13/2022] Open
Abstract
Compared to transcription initiation, much less is known about transcription termination. In particular, large-scale mutagenesis studies have, so far, primarily concentrated on promoter and enhancer, but not terminator sequences. Here, we used a massively parallel reporter assay (MPRA) to systematically analyze the influence of short (8 bp) sequence variants (mutations) located downstream of the polyadenylation signal (PAS) on the steady-state mRNA level of the upstream gene, employing an eGFP reporter and human HEK293T cells as a model system. In total, we evaluated 227,755 mutations located at different overlapping positions within +17..+56 bp downstream of the PAS for their ability to regulate the reporter gene expression. We found that the positions +17..+44 bp downstream of the PAS are more essential for gene upregulation than those located more distal to the PAS, and that the mutation sequences ensuring high levels of eGFP mRNA expression are extremely T-rich. Next, we validated the positive effect of a couple of mutations identified in the MPRA screening on the eGFP and luciferase protein expression. The most promising mutation increased the expression of the reporter proteins 13-fold and sevenfold on average in HEK293T and CHO cells, respectively. Overall, these findings might be useful for further improving the efficiency of production of therapeutic products, e.g., recombinant antibodies.
Collapse
|
22
|
Hadar S, Meller A, Saida N, Shalgi R. Stress-induced transcriptional readthrough into neighboring genes is linked to intron retention. iScience 2022; 25:105543. [PMID: 36505935 PMCID: PMC9732411 DOI: 10.1016/j.isci.2022.105543] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 07/10/2022] [Accepted: 11/07/2022] [Indexed: 11/11/2022] Open
Abstract
Exposure to certain stresses leads to readthrough transcription. Using polyA-selected RNA-seq in mouse fibroblasts subjected to heat shock, oxidative, or osmotic stress, we found that readthrough transcription can proceed into proximal downstream genes, in a phenomenon previously termed "read-in." We found that read-in genes share distinctive genomic characteristics; they are GC-rich and extremely short , with genomic features conserved in human. Using ribosome profiling, we found that read-in genes show significantly reduced translation. Strikingly, read-in genes demonstrate marked intron retention, mostly in their first introns, which could not be explained solely by their short introns and GC-richness, features often associated with intron retention. Finally, we revealed H3K36me3 enrichment upstream to read-in genes. Moreover, demarcation of exon-intron junctions by H3K36me3 was absent in read-in first introns. Our data portray a relationship between read-in and intron retention, suggesting they may have co-evolved to facilitate reduced translation of read-in genes during stress.
Collapse
Affiliation(s)
- Shani Hadar
- Department of Biochemistry, Rappaport Faculty of Medicine, Technion–Israel Institute of Technology, Haifa 31096, Israel
| | - Anatoly Meller
- Department of Biochemistry, Rappaport Faculty of Medicine, Technion–Israel Institute of Technology, Haifa 31096, Israel
| | - Naseeb Saida
- Department of Biochemistry, Rappaport Faculty of Medicine, Technion–Israel Institute of Technology, Haifa 31096, Israel
| | - Reut Shalgi
- Department of Biochemistry, Rappaport Faculty of Medicine, Technion–Israel Institute of Technology, Haifa 31096, Israel,Corresponding author
| |
Collapse
|
23
|
Gutierrez PA, Wei J, Sun Y, Tong L. Molecular basis for the recognition of the AUUAAA polyadenylation signal by mPSF. RNA (NEW YORK, N.Y.) 2022; 28:1534-1541. [PMID: 36130077 PMCID: PMC9745836 DOI: 10.1261/rna.079322.122] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 08/22/2022] [Indexed: 06/15/2023]
Abstract
The polyadenylation signal (PAS) is a key sequence element for 3'-end cleavage and polyadenylation of messenger RNA precursors (pre-mRNAs). This hexanucleotide motif is recognized by the mammalian polyadenylation specificity factor (mPSF), consisting of CPSF160, WDR33, CPSF30, and Fip1 subunits. Recent studies have revealed how the AAUAAA PAS, the most frequently observed PAS, is recognized by mPSF. We report here the structure of human mPSF in complex with the AUUAAA PAS, the second most frequently identified PAS. Conformational differences are observed for the A1 and U2 nucleotides in AUUAAA compared to the A1 and A2 nucleotides in AAUAAA, while the binding modes of the remaining 4 nt are essentially identical. The 5' phosphate of U2 moves by 2.6 Å and the U2 base is placed near the six-membered ring of A2 in AAUAAA, where it makes two hydrogen bonds with zinc finger 2 (ZF2) of CPSF30, which undergoes conformational changes as well. We also attempted to determine the binding modes of two rare PAS hexamers, AAGAAA and GAUAAA, but did not observe the RNA in the cryo-electron microscopy density. The residues in CPSF30 (ZF2 and ZF3) and WDR33 that recognize PAS are disordered in these two structures.
Collapse
Affiliation(s)
- Pedro A Gutierrez
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | - Jia Wei
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | - Yadong Sun
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| | - Liang Tong
- Department of Biological Sciences, Columbia University, New York, New York 10027, USA
| |
Collapse
|
24
|
Liu J, Lu X, Zhang S, Yuan L, Sun Y. Molecular Insights into mRNA Polyadenylation and Deadenylation. Int J Mol Sci 2022; 23:ijms231910985. [PMID: 36232288 PMCID: PMC9570436 DOI: 10.3390/ijms231910985] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 08/31/2022] [Accepted: 09/06/2022] [Indexed: 11/28/2022] Open
Abstract
Poly(A) tails are present on almost all eukaryotic mRNAs, and play critical roles in mRNA stability, nuclear export, and translation efficiency. The biosynthesis and shortening of a poly(A) tail are regulated by large multiprotein complexes. However, the molecular mechanisms of these protein machineries still remain unclear. Recent studies regarding the structural and biochemical characteristics of those protein complexes have shed light on the potential mechanisms of polyadenylation and deadenylation. This review summarizes the recent structural studies on pre-mRNA 3′-end processing complexes that initiate the polyadenylation and discusses the similarities and differences between yeast and human machineries. Specifically, we highlight recent biochemical efforts in the reconstitution of the active human canonical pre-mRNA 3′-end processing systems, as well as the roles of RBBP6/Mpe1 in activating the entire machinery. We also describe how poly(A) tails are removed by the PAN2-PAN3 and CCR4-NOT deadenylation complexes and discuss the emerging role of the cytoplasmic poly(A)-binding protein (PABPC) in promoting deadenylation. Together, these recent discoveries show that the dynamic features of these machineries play important roles in regulating polyadenylation and deadenylation.
Collapse
|
25
|
Duran-Arqué B, Cañete M, Castellazzi CL, Bartomeu A, Ferrer-Caelles A, Reina O, Caballé A, Gay M, Arauz-Garofalo G, Belloc E, Mendez R. Comparative analyses of vertebrate CPEB proteins define two subfamilies with coordinated yet distinct functions in post-transcriptional gene regulation. Genome Biol 2022; 23:192. [PMID: 36096799 PMCID: PMC9465852 DOI: 10.1186/s13059-022-02759-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 08/12/2022] [Indexed: 12/31/2022] Open
Abstract
Background Vertebrate CPEB proteins bind mRNAs at cytoplasmic polyadenylation elements (CPEs) in their 3′ UTRs, leading to cytoplasmic changes in their poly(A) tail lengths; this can promote translational repression or activation of the mRNA. However, neither the regulation nor the mechanisms of action of the CPEB family per se have been systematically addressed to date. Results Based on a comparative analysis of the four vertebrate CPEBs, we determine their differential regulation by phosphorylation, the composition and properties of their supramolecular assemblies, and their target mRNAs. We show that all four CPEBs are able to recruit the CCR4-NOT deadenylation complex to repress the translation. However, their regulation, mechanism of action, and target mRNAs define two subfamilies. Thus, CPEB1 forms ribonucleoprotein complexes that are remodeled upon a single phosphorylation event and are associated with mRNAs containing canonical CPEs. CPEB2–4 are regulated by multiple proline-directed phosphorylations that control their liquid–liquid phase separation. CPEB2–4 mRNA targets include CPEB1-bound transcripts, with canonical CPEs, but also a specific subset of mRNAs with non-canonical CPEs. Conclusions Altogether, these results show how, globally, the CPEB family of proteins is able to integrate cellular cues to generate a fine-tuned adaptive response in gene expression regulation through the coordinated actions of all four members. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02759-y.
Collapse
Affiliation(s)
- Berta Duran-Arqué
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Manuel Cañete
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Chiara Lara Castellazzi
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Anna Bartomeu
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Anna Ferrer-Caelles
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Oscar Reina
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Adrià Caballé
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Marina Gay
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Gianluca Arauz-Garofalo
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Eulalia Belloc
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain
| | - Raúl Mendez
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, 08028, Barcelona, Spain. .,Institució Catalana de Recerca I Estudis Avançats (ICREA), 08010, Barcelona, Spain.
| |
Collapse
|
26
|
Fang L, Guo L, Zhang M, Li X, Deng Z. Analysis of Polyadenylation Signal Usage with Full-Length Transcriptome in Spodoptera frugiperda (Lepidoptera: Noctuidae). INSECTS 2022; 13:803. [PMID: 36135504 PMCID: PMC9505298 DOI: 10.3390/insects13090803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 08/29/2022] [Accepted: 08/30/2022] [Indexed: 06/16/2023]
Abstract
During the messenger RNA (mRNA) maturation process, RNA polyadenylation is a key step, and is coupled to the termination of transcription. Various cis-acting elements near the cleavage site and their binding factors would affect the process of polyadenylation, and AAUAAA, a highly conserved hexamer, was the most important polyadenylation signal (PAS). PAS usage is one of the critical modification determinants targeted at mRNA post-transcription. The full-length transcriptome has recently generated a massive amount of sequencing data, revealing poly(A) variation and alternative polyadenylation (APA) in Spodoptera frugiperda. We identified 50,616 polyadenylation signals in Spodoptera frugiperda via analysis of full-length transcriptome combined with expression Sequence Tags Technology (EST). The polyadenylation signal usage in Spodoptera frugiperda is conserved, and it is similar to that of flies and other animals. AAUAAA and AUUAAA are the most highly conserved polyadenylation signals of all polyadenylation signals we identified. Additionally, we found the U/GU-rich downstream sequence element (DSE) in the cleavage site. These results demonstrate that APA in Spodoptera frugiperda plays a significant role in root growth and development. This is the first polyadenylation signal usage analysis in agricultural pests, which can deepen our understanding of Spodoptera frugiperda and provide a theoretical basis for pest control.
Collapse
Affiliation(s)
- Liying Fang
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China
| | - Lina Guo
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China
| | - Min Zhang
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China
| | - Xianchun Li
- Department of Entomology, BIO5 Institute, University of Arizona, Tucson, AZ 85721, USA
| | - Zhongyuan Deng
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China
| |
Collapse
|
27
|
Saldanha PA, Bolanle IO, Palmer TM, Nikitenko LL, Rivero F. Complex Transcriptional Profiles of the PPP1R12A Gene in Cells of the Circulatory System as Revealed by In Silico Analysis and Reverse Transcription PCR. Cells 2022; 11:cells11152315. [PMID: 35954160 PMCID: PMC9367544 DOI: 10.3390/cells11152315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 07/22/2022] [Accepted: 07/25/2022] [Indexed: 11/16/2022] Open
Abstract
The myosin light chain phosphatase target subunit 1 (MYPT1), encoded by the PPP1R12A gene, is a key component of the myosin light chain phosphatase (MLCP) protein complex. MYPT1 isoforms have been described as products of the cassette-type alternative splicing of exons E13, E14, E22, and E24. Through in silico analysis of the publicly available EST and mRNA databases, we established that PPP1R12A contains 32 exons (6 more than the 26 previously reported), of which 29 are used in 11 protein-coding transcripts. An in silico analysis of publicly available RNAseq data combined with validation by reverse transcription (RT)-PCR allowed us to determine the relative abundance of each transcript in three cell types of the circulatory system where MYPT1 plays important roles: human umbilical vein endothelial cells (HUVEC), human saphenous vein smooth muscle cells (HSVSMC), and platelets. All three cell types express up to 10 transcripts at variable frequencies. HUVECs and HSVSMCs predominantly express the full-length variant (58.3% and 64.3%, respectively) followed by the variant skipping E13 (33.7% and 23.1%, respectively), whereas in platelets the predominant variants are those skipping E14 (51.4%) and E13 (19.9%), followed by the full-length variant (14.4%). Variants including E24 account for 5.4% of transcripts in platelets but are rare (<1%) in HUVECs and HSVSMCs. Complex transcriptional profiles were also found across organs using in silico analysis of RNAseq data from the GTEx project. Our findings provide a platform for future studies investigating the specific (patho)physiological roles of understudied MYPT1 isoforms.
Collapse
|
28
|
Miyoshi K, Hagita H, Horiguchi T, Tanimura A, Noma T. Redefining GBA gene structure unveils the ability of Cap-independent, IRES-dependent gene regulation. Commun Biol 2022; 5:639. [PMID: 35831491 PMCID: PMC9279297 DOI: 10.1038/s42003-022-03577-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 06/10/2022] [Indexed: 11/09/2022] Open
Abstract
Glucosylceramide is the primary molecule of glycosphingolipids, and its metabolic regulation is crucial for life. Defects in the catabolizing enzyme, glucocerebrosidase (GCase), cause a lysosomal storage disorder known as Gaucher disease. However, the genetic regulation of GCase has not been fully understood. Here we show the redefined structure of the GCase coding gene (GBA), and clarify the regulatory mechanisms of its transcription and translation. First, alternative uses of the two GBA gene promoters were identified in fibroblasts and HL60-derived macrophages. Intriguingly, both GBA transcripts and GCase activities were induced in macrophages but not in neutrophils. Second, we observed cap-independent translation occurs via unique internal ribosome entry site activities in first promoter-driven GBA transcripts. Third, the reciprocal expression was observed in GBA and miR22-3p versus GBAP1 transcripts before and after HL60-induced macrophage differentiation. Nevertheless, these findings clearly demonstrate novel cell-type-specific GBA gene expression regulatory mechanisms, providing new insights into GCase biology. The cell type-specific expression of the glucocerebrosidase gene, associated with the lysosomal storage disorder called Gaucher disease, is linked to cis- and trans-regulatory transcriptional and translational mechanisms.
Collapse
Affiliation(s)
- Keiko Miyoshi
- Department of Oral Bioscience, Tokushima University Graduate School of Biomedical Sciences, Tokushima, 770-8504, Japan.
| | - Hiroko Hagita
- Department of Oral Bioscience, Tokushima University Graduate School of Biomedical Sciences, Tokushima, 770-8504, Japan
| | - Taigo Horiguchi
- Department of Oral Bioscience, Tokushima University Graduate School of Biomedical Sciences, Tokushima, 770-8504, Japan
| | - Ayako Tanimura
- Division of Food & Health Sciences, Department of Environmental and Symbiotic Sciences, Faculty of Environmental and Symbiotic Sciences, Prefectural University of Kumamoto, Kumamoto, 862-8502, Japan
| | - Takafumi Noma
- Department of Nutrition and Health Promotion, Faculty of Human Life Studies, Hiroshima Jogakuin University, 4-13-1 Ushita-higashi, Higashi-ku, Hiroshima, 732-0063, Japan
| |
Collapse
|
29
|
Zheng Y, Li X, Jiao Y, Wu C. High-Risk Human Papillomavirus Oncogenic E6/E7 mRNAs Splicing Regulation. Front Cell Infect Microbiol 2022; 12:929666. [PMID: 35832386 PMCID: PMC9271614 DOI: 10.3389/fcimb.2022.929666] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 05/19/2022] [Indexed: 11/22/2022] Open
Abstract
High-risk human papillomavirus infection may develop into a persistent infection that is highly related to the progression of various cancers, including cervical cancer and head and neck squamous cell carcinomas. The most common high-risk subtypes are HPV16 and HPV18. The oncogenic viral proteins expressed by high-risk HPVs E6/E7 are tightly involved in cell proliferation, differentiation, and cancerous transformation since E6/E7 mRNAs are derived from the same pre-mRNA. Hence, the alternative splicing in the E6/E7-coding region affects the balance of the E6/E7 expression level. Interrupting the balance of E6 and E7 levels results in cell apoptosis. Therefore, it is crucial to understand the regulation of E6/E7 splice site selection and the interaction of splicing enhancers and silencers with cellular splicing factors. In this review, we concluded the relationship of different E6/E7 transcripts with cancer progression, the known splicing sites, and the identified cis-regulatory elements within high-risk HPV E6/E7-coding region. Finally, we also reviewed the role of various splicing factors in the regulation of high-risk HPV oncogenic E6/E7 mRNA splicing.
Collapse
Affiliation(s)
- Yunji Zheng
- School of Pharmacy, Binzhou Medical University, Yantai, China
| | - Xue Li
- School of Pharmacy, Binzhou Medical University, Yantai, China
| | - Yisheng Jiao
- School of Biomedical Engineering, Dalian University of Technology, Dalian, China
| | - Chengjun Wu
- School of Biomedical Engineering, Dalian University of Technology, Dalian, China
- *Correspondence: Chengjun Wu,
| |
Collapse
|
30
|
Kwon B, Fansler MM, Patel ND, Lee J, Ma W, Mayr C. Enhancers regulate 3' end processing activity to control expression of alternative 3'UTR isoforms. Nat Commun 2022; 13:2709. [PMID: 35581194 PMCID: PMC9114392 DOI: 10.1038/s41467-022-30525-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 05/02/2022] [Indexed: 12/12/2022] Open
Abstract
Multi-UTR genes are widely transcribed and express their alternative 3'UTR isoforms in a cell type-specific manner. As transcriptional enhancers regulate mRNA expression, we investigated if they also regulate 3'UTR isoform expression. Endogenous enhancer deletion of the multi-UTR gene PTEN did not impair transcript production but prevented 3'UTR isoform switching which was recapitulated by silencing of an enhancer-bound transcription factor. In reporter assays, enhancers increase transcript production when paired with single-UTR gene promoters. However, when combined with multi-UTR gene promoters, they change 3'UTR isoform expression by increasing 3' end processing activity of polyadenylation sites. Processing activity of polyadenylation sites is affected by transcription factors, including NF-κB and MYC, transcription elongation factors, chromatin remodelers, and histone acetyltransferases. As endogenous cell type-specific enhancers are associated with genes that increase their short 3'UTRs in a cell type-specific manner, our data suggest that transcriptional enhancers integrate cellular signals to regulate cell type-and condition-specific 3'UTR isoform expression.
Collapse
Affiliation(s)
- Buki Kwon
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Mervin M Fansler
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Graduate College, New York, NY, 10021, USA
| | - Neil D Patel
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Jihye Lee
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Weirui Ma
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - Christine Mayr
- Cancer Biology and Genetics Program, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA.
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Graduate College, New York, NY, 10021, USA.
| |
Collapse
|
31
|
Bilodeau DY, Sheridan RM, Balan B, Jex AR, Rissland OS. Precise gene models using long-read sequencing reveal a unique poly(A) signal in Giardia lamblia. RNA (NEW YORK, N.Y.) 2022; 28:668-682. [PMID: 35110372 PMCID: PMC9014877 DOI: 10.1261/rna.078793.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 01/17/2022] [Indexed: 06/14/2023]
Abstract
During pre-mRNA processing, the poly(A) signal is recognized by a protein complex that ensures precise cleavage and polyadenylation of the nascent transcript. The location of this cleavage event establishes the length and sequence of the 3' UTR of an mRNA, thus determining much of its post-transcriptional fate. Using long-read sequencing, we characterize the polyadenylation signal and related sequences surrounding Giardia lamblia cleavage sites for over 2600 genes. We find that G. lamblia uses an AGURAA poly(A) signal, which differs from the mammalian AAUAAA. We also describe how G. lamblia lacks common auxiliary elements found in other eukaryotes, along with the proteins that recognize them. Further, we identify 133 genes with evidence of alternative polyadenylation. These results suggest that despite pared-down cleavage and polyadenylation machinery, 3' end formation still appears to be an important regulatory step for gene expression in G. lamblia.
Collapse
Affiliation(s)
- Danielle Y Bilodeau
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
- RNA Bioscience Initiative, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
| | - Ryan M Sheridan
- RNA Bioscience Initiative, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
| | - Balu Balan
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Melbourne, VIC 3052, Australia
| | - Aaron R Jex
- Population Health and Immunity Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Melbourne, VIC 3052, Australia
- Faculty of Veterinary and Agricultural Sciences, The University of Melbourne, Parkville, VIC 3052, Australia
| | - Olivia S Rissland
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
- RNA Bioscience Initiative, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
| |
Collapse
|
32
|
Leveraging omic features with F3UTER enables identification of unannotated 3'UTRs for synaptic genes. Nat Commun 2022; 13:2270. [PMID: 35477703 PMCID: PMC9046390 DOI: 10.1038/s41467-022-30017-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 03/18/2022] [Indexed: 11/08/2022] Open
Abstract
There is growing evidence for the importance of 3' untranslated region (3'UTR) dependent regulatory processes. However, our current human 3'UTR catalogue is incomplete. Here, we develop a machine learning-based framework, leveraging both genomic and tissue-specific transcriptomic features to predict previously unannotated 3'UTRs. We identify unannotated 3'UTRs associated with 1,563 genes across 39 human tissues, with the greatest abundance found in the brain. These unannotated 3'UTRs are significantly enriched for RNA binding protein (RBP) motifs and exhibit high human lineage-specificity. We find that brain-specific unannotated 3'UTRs are enriched for the binding motifs of important neuronal RBPs such as TARDBP and RBFOX1, and their associated genes are involved in synaptic function. Our data is shared through an online resource F3UTER ( https://astx.shinyapps.io/F3UTER/ ). Overall, our data improves 3'UTR annotation and provides additional insights into the mRNA-RBP interactome in the human brain, with implications for our understanding of neurological and neurodevelopmental diseases.
Collapse
|
33
|
Pereira-Castro I, Garcia BC, Curinha A, Neves-Costa A, Conde-Sousa E, Moita LF, Moreira A. MCL1 alternative polyadenylation is essential for cell survival and mitochondria morphology. Cell Mol Life Sci 2022; 79:164. [PMID: 35229202 PMCID: PMC11072748 DOI: 10.1007/s00018-022-04172-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 01/07/2022] [Accepted: 01/27/2022] [Indexed: 02/02/2023]
Abstract
Alternative polyadenylation in the 3' UTR (3' UTR-APA) is a mode of gene expression regulation, fundamental for mRNA stability, translation and localization. In the immune system, it was shown that upon T cell activation, there is an increase in the relative expression of mRNA isoforms with short 3' UTRs resulting from 3' UTR-APA. However, the functional significance of 3' UTR-APA remains largely unknown. Here, we studied the physiological function of 3' UTR-APA in the regulation of Myeloid Cell Leukemia 1 (MCL1), an anti-apoptotic member of the Bcl-2 family essential for T cell survival. We found that T cells produce two MCL1 mRNA isoforms (pA1 and pA2) by 3' UTR-APA. We show that upon T cell activation, there is an increase in both the shorter pA1 mRNA isoform and MCL1 protein levels. Moreover, the less efficiently translated pA2 isoform is downregulated by miR-17, which is also more expressed upon T cell activation. Therefore, by increasing the expression of the more efficiently translated pA1 mRNA isoform, which escapes regulation by miR-17, 3' UTR-APA fine tunes MCL1 protein levels, critical for activated T cells' survival. Furthermore, using CRISPR/Cas9-edited cells, we show that depletion of either pA1 or pA2 mRNA isoforms causes severe defects in mitochondria morphology, increases apoptosis and impacts cell proliferation. Collectively, our results show that MCL1 alternative polyadenylation has a key role in the regulation of MCL1 protein levels upon T cell activation and reveal an essential function for MCL1 3' UTR-APA in cell viability and mitochondria dynamics.
Collapse
Affiliation(s)
- Isabel Pereira-Castro
- Gene Regulation, i3S, Instituto de Investigação E Inovação Em Saúde, Universidade Do Porto, Porto, Portugal.
- Gene Regulation, IBMC, Instituto de Biologia Molecular E Celular, Universidade Do Porto, Porto, Portugal.
| | - Beatriz C Garcia
- Gene Regulation, i3S, Instituto de Investigação E Inovação Em Saúde, Universidade Do Porto, Porto, Portugal
- Gene Regulation, IBMC, Instituto de Biologia Molecular E Celular, Universidade Do Porto, Porto, Portugal
| | - Ana Curinha
- Gene Regulation, IBMC, Instituto de Biologia Molecular E Celular, Universidade Do Porto, Porto, Portugal
- Department of Molecular Biology and Genetics, John Hopkins University School of Medicine, Baltimore, MD, USA
| | | | - Eduardo Conde-Sousa
- i3S, Instituto de Investigação E Inovação Em Saúde, Universidade Do Porto, Porto, Portugal
- INEB, Instituto de Engenharia Biomédica, Universidade Do Porto, Porto, Portugal
| | - Luís F Moita
- Instituto Gulbenkian de Ciência (IGC), Oeiras, Portugal
| | - Alexandra Moreira
- Gene Regulation, i3S, Instituto de Investigação E Inovação Em Saúde, Universidade Do Porto, Porto, Portugal.
- Gene Regulation, IBMC, Instituto de Biologia Molecular E Celular, Universidade Do Porto, Porto, Portugal.
- ICBAS, Instituto de Ciências Biomédicas Abel Salazar, Universidade Do Porto, Porto, Portugal.
| |
Collapse
|
34
|
Jankovic B, Gojobori T. From shallow to deep: some lessons learned from application of machine learning for recognition of functional genomic elements in human genome. Hum Genomics 2022; 16:7. [PMID: 35180894 PMCID: PMC8855580 DOI: 10.1186/s40246-022-00376-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 01/02/2022] [Indexed: 11/25/2022] Open
Abstract
Identification of genomic signals as indicators for functional genomic elements is one of the areas that received early and widespread application of machine learning methods. With time, the methods applied grew in variety and generally exhibited a tendency to improve their ability to identify some major genomic and transcriptomics signals. The evolution of machine learning in genomics followed a similar path to applications of machine learning in other fields. These were impacted in a major way by three dominant developments, namely an enormous increase in availability and quality of data, a significant increase in computational power available to machine learning applications, and finally, new machine learning paradigms, of which deep learning is the most well-known example. It is not easy in general to distinguish factors leading to improvements in results of applications of machine learning. This is even more so in the field of genomics, where the advent of next-generation sequencing and the increased ability to perform functional analysis of raw data have had a major effect on the applicability of machine learning in OMICS fields. In this paper, we survey the results from a subset of published work in application of machine learning in the recognition of genomic signals and regions in human genome and summarize some lessons learnt from this endeavor. There is no doubt that a significant progress has been made both in terms of accuracy and reliability of models. Questions remain however whether the progress has been sufficient and what these developments bring to the field of genomics in general and human genomics in particular. Improving usability, interpretability and accuracy of models remains an important open challenge for current and future research in application of machine learning and more generally of artificial intelligence methods in genomics.
Collapse
Affiliation(s)
- Boris Jankovic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Takashi Gojobori
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia. .,Division of Biological and Environmental Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
| |
Collapse
|
35
|
RBBP6 activates the pre-mRNA 3' end processing machinery in humans. Genes Dev 2022; 36:210-224. [PMID: 35177536 PMCID: PMC8887125 DOI: 10.1101/gad.349223.121] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 02/01/2022] [Indexed: 11/25/2022]
Abstract
In this study, Boreikaite et al. reconstituted specific and efficient 3′ endonuclease activity of human CPSF with purified proteins. This required the seven-subunit CPSF as well as three additional protein factors: cleavage stimulatory factor (CStF), cleavage factor IIm (CFIIm), and, importantly, the multidomain protein RBBP6. 3′ end processing of most human mRNAs is carried out by the cleavage and polyadenylation specificity factor (CPSF; CPF in yeast). Endonucleolytic cleavage of the nascent pre-mRNA defines the 3′ end of the mature transcript, which is important for mRNA localization, translation, and stability. Cleavage must therefore be tightly regulated. Here, we reconstituted specific and efficient 3′ endonuclease activity of human CPSF with purified proteins. This required the seven-subunit CPSF as well as three additional protein factors: cleavage stimulatory factor (CStF), cleavage factor IIm (CFIIm), and, importantly, the multidomain protein RBBP6. Unlike its yeast homolog Mpe1, which is a stable subunit of CPF, RBBP6 does not copurify with CPSF and is recruited in an RNA-dependent manner. Sequence and mutational analyses suggest that RBBP6 interacts with the WDR33 and CPSF73 subunits of CPSF. Thus, it is likely that the role of RBBP6 is conserved from yeast to humans. Overall, our data are consistent with CPSF endonuclease activation and site-specific pre-mRNA cleavage being highly controlled to maintain fidelity in mRNA processing.
Collapse
|
36
|
Masoumzadeh E, Grozdanov PN, Jetly A, MacDonald CC, Latham MP. Electrostatic Interactions between CSTF2 and pre-mRNA Drive Cleavage and Polyadenylation. Biophys J 2022; 121:607-619. [PMID: 35090899 PMCID: PMC8873925 DOI: 10.1016/j.bpj.2022.01.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 12/20/2021] [Accepted: 01/07/2022] [Indexed: 11/25/2022] Open
Abstract
Nascent pre-mRNA 3'-end cleavage and polyadenylation (C/P) involves numerous proteins that recognize multiple RNA elements. Human CSTF2 binds to a downstream U- or G/U-rich sequence through its RNA recognition motif (RRM) regulating C/P. We previously reported the only known disease-related CSTF2 RRM mutant (CSTF2D50A) and showed that it changed the on-rate of RNA binding, leading to alternative polyadenylation in brains of mice carrying the same mutation. In this study, we further investigated the role of electrostatic interactions in the thermodynamics and kinetics of RNA binding for the CSTF2 RRM and the downstream consequences for regulation of C/P. By combining mutagenesis with NMR spectroscopy and biophysical assays, we confirmed that electrostatic attraction is the dominant factor in RRM binding to a naturally occurring U-rich RNA sequence. Moreover, we demonstrate that RNA binding is accompanied by an enthalpy-entropy compensation mechanism that is supported by changes in pico-to-nanosecond timescale RRM protein dynamics. We suggest that the dynamic binding of the RRM to U-rich RNA supports the diversity of sequences it encounters in the nucleus. Lastly, in vivo C/P assays demonstrate a competition between fast, high affinity RNA binding and efficient, correct C/P. These results highlight the importance of the surface charge of the RRM in RNA binding and the balance between nascent mRNA binding and C/P in vivo.
Collapse
|
37
|
Huang CK, Lin WD, Wu SH. An improved repertoire of splicing variants and their potential roles in Arabidopsis photomorphogenic development. Genome Biol 2022; 23:50. [PMID: 35139889 PMCID: PMC8827149 DOI: 10.1186/s13059-022-02620-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Accepted: 01/25/2022] [Indexed: 01/03/2023] Open
Abstract
Background Light switches on the photomorphogenic development of young plant seedlings, allowing young seedlings to acquire photosynthetic capacities and gain survival fitness. Light regulates gene expression at all levels of the central dogma, including alternative splicing (AS) during the photomorphogenic development. However, accurate determination of full-length (FL) splicing variants has been greatly hampered by short-read RNA sequencing technologies. Result In this study, we adopt PacBio isoform sequencing (Iso-seq) to overcome the limitation of the short-read RNA-seq technologies. Normalized cDNA libraries used for Iso-seq allows for comprehensive and effective identification of FL AS variants. Our analyses reveal more than 30,000 splicing variant models from approximately 16,500 gene loci and additionally identify approximately 700 previously unannotated genes. Among the variants, approximately 12,000 represent new gene models. Intron retention (IR) is the most frequently observed form of variants, and many IR-containing AS variants show evidence of engagement in translation. Our study reveals the formation of heterodimers of transcription factors composed of annotated and IR-containing AS variants. Moreover, transgenic plants overexpressing the IR forms of two B-BOX DOMAIN PROTEINs exhibits light-hypersensitive phenotypes, suggesting their regulatory roles in modulating optimal light responses. Conclusions This study provides an accurate and comprehensive portrait of full-length transcript isoforms and experimentally confirms the presence of de novo synthesized AS variants that impose regulatory functions in photomorphogenic development in Arabidopsis. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02620-2.
Collapse
Affiliation(s)
- Chun-Kai Huang
- Institute of Plant and Microbial Biology, Academia Sinica, 128, Sec. 2, Academia Rd., Taipei, 11529, Taiwan
| | - Wen-Dar Lin
- The Bioinformatics Core Lab, Institute of Plant and Microbial Biology, Academia Sinica, Taipei, 11529, Taiwan
| | - Shu-Hsing Wu
- Institute of Plant and Microbial Biology, Academia Sinica, 128, Sec. 2, Academia Rd., Taipei, 11529, Taiwan.
| |
Collapse
|
38
|
Raman P, Rominger MC, Young JM, Molaro A, Tsukiyama T, Malik HS. Novel classes and evolutionary turnover of histone H2B variants in the mammalian germline. Mol Biol Evol 2022; 39:6517784. [PMID: 35099534 PMCID: PMC8857922 DOI: 10.1093/molbev/msac019] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Histones and their posttranslational modifications facilitate diverse chromatin functions in eukaryotes. Core histones (H2A, H2B, H3, and H4) package genomes after DNA replication. In contrast, variant histones promote specialized chromatin functions, including DNA repair, genome stability, and epigenetic inheritance. Previous studies have identified only a few H2B variants in animals; their roles and evolutionary origins remain largely unknown. Here, using phylogenomic analyses, we reveal the presence of five H2B variants broadly present in mammalian genomes. Three of these variants have been previously described: H2B.1, H2B.L (also called subH2B), and H2B.W. In addition, we identify and describe two new variants: H2B.K and H2B.N. Four of these variants originated in mammals, whereas H2B.K arose prior to the last common ancestor of bony vertebrates. We find that though H2B variants are subject to high gene turnover, most are broadly retained in mammals, including humans. Despite an overall signature of purifying selection, H2B variants evolve more rapidly than core H2B with considerable divergence in sequence and length. All five H2B variants are expressed in the germline. H2B.K and H2B.N are predominantly expressed in oocytes, an atypical expression site for mammalian histone variants. Our findings suggest that H2B variants likely encode potentially redundant but vital functions via unusual chromatin packaging or nonchromatin functions in mammalian germline cells. Our discovery of novel histone variants highlights the advantages of comprehensive phylogenomic analyses and provides unique opportunities to study how innovations in chromatin function evolve.
Collapse
Affiliation(s)
- Pravrutha Raman
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
| | - Mary C Rominger
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
- Whitman College, Walla Walla, Washington, 99362, USA
| | - Janet M Young
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
| | - Antoine Molaro
- Genetics, Reproduction and Development (GReD) Institute, CNRS UMR 6293, INSERM U1103, Université Clermont Auvergne, Clermont-Ferrand, France
| | - Toshio Tsukiyama
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
| | - Harmit S Malik
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington, 98109, USA
| |
Collapse
|
39
|
Zhang Y, Song J, Zhang M, Deng Z. Analysis Polyadenylation Signal Usage in Sus scrofa. Animals (Basel) 2022; 12:ani12020194. [PMID: 35049816 PMCID: PMC8773104 DOI: 10.3390/ani12020194] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 01/01/2022] [Accepted: 01/10/2022] [Indexed: 12/12/2022] Open
Abstract
RNA polyadenylation is an important step in the messenger RNA (mRNA) maturation process, and the first step is recognizing the polyadenylation signal (PAS). The PAS type and distribution is a key determinant of post-transcriptional mRNA modification and gene expression. However, little is known about PAS usage and alternative polyadenylation (APA) regulation in livestock species. Recently, sequencing technology has enabled the generation of a large amount of sequencing data revealing variation in poly(A) signals and APA regulation in Sus scrofa. We identified 62,491 polyadenylation signals in Sus scrofa using expressed sequence tag (EST) sequences combined with RNA-seq analysis. The composition and usage frequency of polyadenylation signal in Sus scrofa is similar with that of human and mouse. The most highly conserved polyadenylation signals are AAUAAA and AUUAAA, used for over 63.35% of genes. In addition, we also analyzed the U/GU-rich downstream sequence (DSE) element, located downstream of the cleavage site. Our results indicate that APA regulation was widely occurred in Sus scrofa, as in other organisms. Our result was useful for the accurate annotation of RNA 3' ends in Sus scrofa and the analysis of polyadenylation signal usage in Sus scrofa would give the new insights into the mechanisms of transcriptional regulation.
Collapse
Affiliation(s)
- Yuting Zhang
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China; (Y.Z.); (M.Z.)
| | - Jingwen Song
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China;
| | - Min Zhang
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China; (Y.Z.); (M.Z.)
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China;
| | - Zhongyuan Deng
- School of Agricultural Sciences, Zhengzhou University, Zhengzhou 450001, China; (Y.Z.); (M.Z.)
- School of Life Sciences, Zhengzhou University, Zhengzhou 450001, China;
- Correspondence:
| |
Collapse
|
40
|
Tian S, Zhang B, He Y, Sun Z, Li J, Li Y, Yi H, Zhao Y, Zou X, Li Y, Cui H, Fang L, Gao X, Hu Y, Chen W. OUP accepted manuscript. Nucleic Acids Res 2022; 50:e26. [PMID: 35191504 PMCID: PMC8934656 DOI: 10.1093/nar/gkac108] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 02/01/2022] [Accepted: 02/19/2022] [Indexed: 11/14/2022] Open
Affiliation(s)
| | | | - Yuhao He
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Zhiyuan Sun
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Jun Li
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
- Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen 518055, China
| | - Yisheng Li
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Hongyang Yi
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Yan Zhao
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Xudong Zou
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Yunfei Li
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
| | - Huanhuan Cui
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
- Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen 518055, China
| | - Liang Fang
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
- Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen 518055, China
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Yuhui Hu
- Shenzhen Key Laboratory of Gene Regulation and Systems Biology, Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen 518055, China
- Academy for Advanced Interdisciplinary Studies, Southern University of Science and Technology, Shenzhen 518055, China
| | - Wei Chen
- To whom correspondence should be addressed. Tel: +86 755 88018449;
| |
Collapse
|
41
|
Alternative polyadenylation: An untapped source for prostate cancer biomarkers and therapeutic targets? Asian J Urol 2021; 8:407-415. [PMID: 34765448 PMCID: PMC8566364 DOI: 10.1016/j.ajur.2021.05.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 03/20/2021] [Accepted: 05/05/2021] [Indexed: 11/25/2022] Open
Abstract
Objective To review alternative polyadenylation (APA) as a mechanism of gene regulation and consider potential roles for APA in prostate cancer (PCa) biology and treatment. Methods An extensive review of mRNA polyadenylation, APA, and PCa literature was performed. This review article introduces APA and its association with human disease, outlines the mechanisms and components of APA, reviews APA in cancer biology, and considers whether APA may contribute to PCa progression and/or produce novel biomarkers and therapeutic targets for PCa. Results Eukaryotic mRNA 3′-end cleavage and polyadenylation play a critical role in gene expression. Most human genes encode more than one polyadenylation signal, and produce more than one transcript isoform, through APA. Polyadenylation can occur throughout the gene body to generate transcripts with differing 3′-termini and coding sequence. Differences in 3′-untranslated regions length can modify post-transcriptional gene regulation by microRNAs and RNA binding proteins, and alter mRNA stability, translation efficiency, and subcellular localization. Distinctive APA patterns are associated with human diseases, tissue origins, and changes in cellular proliferation rate and differentiation state. APA events may therefore generate unique mRNA biomarkers or therapeutic targets in certain cancer types or phenotypic states. Conclusions The full extent of cancer-associated and tissue-specific APA events have yet to be defined, and the mechanisms and functional consequences of APA in cancer remain incompletely understood. There is evidence that APA is active in PCa, and that it may be an untapped resource for PCa biomarkers or therapeutic targets.
Collapse
|
42
|
Dasilva LF, Blumenthal E, Beckedorff F, Cingaram PR, Gomes Dos Santos H, Edupuganti RR, Zhang A, Dokaneheifard S, Aoi Y, Yue J, Kirstein N, Tayari MM, Shilatifard A, Shiekhattar R. Integrator enforces the fidelity of transcriptional termination at protein-coding genes. SCIENCE ADVANCES 2021; 7:eabe3393. [PMID: 34730992 PMCID: PMC8565846 DOI: 10.1126/sciadv.abe3393] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Accepted: 09/15/2021] [Indexed: 06/12/2023]
Abstract
Integrator regulates the 3′-end processing and termination of multiple classes of noncoding RNAs. Depletion of INTS11, the catalytic subunit of Integrator, or ectopic expression of its catalytic dead enzyme impairs the 3′-end processing and termination of a set of protein-coding transcripts termed Integrator-regulated termination (IRT) genes. This defect is manifested by increased RNA polymerase II (RNAPII) readthrough and occupancy of serine-2 phosphorylated RNAPII, de novo trimethylation of lysine-36 on histone H3, and a compensatory elevation of the cleavage and polyadenylation (CPA) complex beyond the canonical polyadenylation sites. 3′ RNA sequencing reveals that proximal polyadenylation site usage relies on the endonuclease activity of INTS11. The DNA sequence encompassing the transcription end sites of IRT genes features downstream polyadenylation motifs and an enrichment of GC content that permits the formation of secondary structures within the 3′UTR. Together, this study identifies a subset of protein-coding transcripts whose 3′ end processing requires the Integrator complex.
Collapse
Affiliation(s)
- Lucas Ferreira Dasilva
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Ezra Blumenthal
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
- Medical Scientist Training Program and Graduate Program in Cancer Biology, University of Miami Miller School of Medicine, Miami, FL 33136, USA
| | - Felipe Beckedorff
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Pradeep Reddy Cingaram
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Helena Gomes Dos Santos
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Raghu Ram Edupuganti
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Anda Zhang
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Sadat Dokaneheifard
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Yuki Aoi
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Jingyin Yue
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Nina Kirstein
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Mina Masoumeh Tayari
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| | - Ali Shilatifard
- Department of Biochemistry and Molecular Genetics, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
- Robert H. Lurie Comprehensive Cancer Center, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Ramin Shiekhattar
- Department of Human Genetics, University of Miami Miller School of Medicine, Sylvester Comprehensive Cancer Center, 1501 NW 10th Avenue, Miami, FL 33136, USA
| |
Collapse
|
43
|
Lackey L, Coria A, Ghosh AJ, Grayeski P, Hatfield A, Shankar V, Platig J, Xu Z, Ramos SBV, Silverman EK, Ortega VE, Cho MH, Hersh CP, Hobbs BD, Castaldi P, Laederach A. Alternative poly-adenylation modulates α1-antitrypsin expression in chronic obstructive pulmonary disease. PLoS Genet 2021; 17:e1009912. [PMID: 34784346 PMCID: PMC8631626 DOI: 10.1371/journal.pgen.1009912] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 11/30/2021] [Accepted: 10/25/2021] [Indexed: 01/07/2023] Open
Abstract
α1-anti-trypsin (A1AT), encoded by SERPINA1, is a neutrophil elastase inhibitor that controls the inflammatory response in the lung. Severe A1AT deficiency increases risk for Chronic Obstructive Pulmonary Disease (COPD), however, the role of A1AT in COPD in non-deficient individuals is not well known. We identify a 2.1-fold increase (p = 2.5x10-6) in the use of a distal poly-adenylation site in primary lung tissue RNA-seq in 82 COPD cases when compared to 64 controls and replicate this in an independent study of 376 COPD and 267 controls. This alternative polyadenylation event involves two sites, a proximal and distal site, 61 and 1683 nucleotides downstream of the A1AT stop codon. To characterize this event, we measured the distal ratio in human primary tissue short read RNA-seq data and corroborated our results with long read RNA-seq data. Integrating these results with 3' end RNA-seq and nanoluciferase reporter assay experiments we show that use of the distal site yields mRNA transcripts with over 50-fold decreased translation efficiency and A1AT expression. We identified seven RNA binding proteins using enhanced CrossLinking and ImmunoPrecipitation precipitation (eCLIP) with one or more binding sites in the SERPINA1 3' UTR. We combined these data with measurements of the distal ratio in shRNA knockdown experiments, nuclear and cytoplasmic fractionation, and chemical RNA structure probing. We identify Quaking Homolog (QKI) as a modulator of SERPINA1 mRNA translation and confirm the role of QKI in SERPINA1 translation with luciferase reporter assays. Analysis of single-cell RNA-seq showed differences in the distribution of the SERPINA1 distal ratio among hepatocytes, macrophages, αβ-Tcells and plasma cells in the liver. Alveolar Type 1,2, dendritic cells and macrophages also vary in their distal ratio in the lung. Our work reveals a complex post-transcriptional mechanism that regulates alternative polyadenylation and A1AT expression in COPD.
Collapse
Affiliation(s)
- Lela Lackey
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - Aaztli Coria
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Auyon J. Ghosh
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Phil Grayeski
- Curriculum in Genetics and Molecular Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Abigail Hatfield
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - Vijay Shankar
- Department of Genetics and Biochemistry, Center for Human Genetics, Clemson University, Greenwood, South Carolina, United States of America
| | - John Platig
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Zhonghui Xu
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Silvia B. V. Ramos
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Edwin K. Silverman
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Victor E. Ortega
- Department of Internal Medicine, Division of Respiratory Medicine, Center for Individualized Medicine, Mayo Clinic, Scottsdale, Arizona, United States of America
| | - Michael H. Cho
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Craig P. Hersh
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Brian D. Hobbs
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Peter Castaldi
- Channing Division of Network Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
- Division of Internal Medicine and Primary Care, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Alain Laederach
- Department of Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| |
Collapse
|
44
|
Shah A, Mittleman BE, Gilad Y, Li YI. Benchmarking sequencing methods and tools that facilitate the study of alternative polyadenylation. Genome Biol 2021; 22:291. [PMID: 34649612 PMCID: PMC8518154 DOI: 10.1186/s13059-021-02502-z] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Accepted: 09/16/2021] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Alternative cleavage and polyadenylation (APA), an RNA processing event, occurs in over 70% of human protein-coding genes. APA results in mRNA transcripts with distinct 3' ends. Most APA occurs within 3' UTRs, which harbor regulatory elements that can impact mRNA stability, translation, and localization. RESULTS APA can be profiled using a number of established computational tools that infer polyadenylation sites from standard, short-read RNA-seq datasets. Here, we benchmarked a number of such tools-TAPAS, QAPA, DaPars2, GETUTR, and APATrap- against 3'-Seq, a specialized RNA-seq protocol that enriches for reads at the 3' ends of genes, and Iso-Seq, a Pacific Biosciences (PacBio) single-molecule full-length RNA-seq method in their ability to identify polyadenylation sites and quantify polyadenylation site usage. We demonstrate that 3'-Seq and Iso-Seq are able to identify and quantify the usage of polyadenylation sites more reliably than computational tools that take short-read RNA-seq as input. However, we find that running one such tool, QAPA, with a set of polyadenylation site annotations derived from small quantities of 3'-Seq or Iso-Seq can reliably quantify variation in APA across conditions, such asacross genotypes, as demonstrated by the successful mapping of alternative polyadenylation quantitative trait loci (apaQTL). CONCLUSIONS We envisage that our analyses will shed light on the advantages of studying APA with more specialized sequencing protocols, such as 3'-Seq or Iso-Seq, and the limitations of studying APA with short-read RNA-seq. We provide a computational pipeline to aid in the identification of polyadenylation sites and quantification of polyadenylation site usages using Iso-Seq data as input.
Collapse
Affiliation(s)
- Ankeeta Shah
- Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA
| | - Briana E Mittleman
- Genetics, Genomics, and Systems Biology, University of Chicago, Chicago, IL, USA
| | - Yoav Gilad
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Yang I Li
- Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL, USA.
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
45
|
Mohanan NK, Shaji F, Koshre GR, Laishram RS. Alternative polyadenylation: An enigma of transcript length variation in health and disease. WILEY INTERDISCIPLINARY REVIEWS-RNA 2021; 13:e1692. [PMID: 34581021 DOI: 10.1002/wrna.1692] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Revised: 06/16/2021] [Accepted: 08/24/2021] [Indexed: 12/19/2022]
Abstract
Alternative polyadenylation (APA) is a molecular mechanism during a pre-mRNA processing that involves usage of more than one polyadenylation site (PA-site) generating transcripts of varying length from a single gene. The location of a PA-site affects transcript length and coding potential of an mRNA contributing to both mRNA and protein diversification. This variation in the transcript length affects mRNA stability and translation, mRNA subcellular and tissue localization, and protein function. APA is now considered as an important regulatory mechanism in the pathophysiology of human diseases. An important consequence of the changes in the length of 3'-untranslated region (UTR) from disease-induced APA is altered protein expression. Yet, the relationship between 3'-UTR length and protein expression remains a paradox in a majority of diseases. Here, we review occurrence of APA, mechanism of PA-site selection, and consequences of transcript length variation in different diseases. Emerging evidence reveals coordinated involvement of core RNA processing factors including poly(A) polymerases in the PA-site selection in diseases-associated APAs. Targeting such APA regulators will be therapeutically significant in combating drug resistance in cancer and other complex diseases. This article is categorized under: RNA Processing > 3' End Processing RNA in Disease and Development > RNA in Disease Translation > Regulation.
Collapse
Affiliation(s)
- Neeraja K Mohanan
- Cardiovascular and Diabetes Biology Group, Rajiv Gandhi Centre for Biotechnology, Trivandrum, India
- Manipal Academy of Higher Education, Manipal, India
| | - Feba Shaji
- Cardiovascular and Diabetes Biology Group, Rajiv Gandhi Centre for Biotechnology, Trivandrum, India
- Regional Centre for Biotechnology, Faridabad, India
| | - Ganesh R Koshre
- Cardiovascular and Diabetes Biology Group, Rajiv Gandhi Centre for Biotechnology, Trivandrum, India
- Manipal Academy of Higher Education, Manipal, India
| | - Rakesh S Laishram
- Cardiovascular and Diabetes Biology Group, Rajiv Gandhi Centre for Biotechnology, Trivandrum, India
| |
Collapse
|
46
|
Wang P, Zhou Y, Richards AM. Effective tools for RNA-derived therapeutics: siRNA interference or miRNA mimicry. Am J Cancer Res 2021; 11:8771-8796. [PMID: 34522211 PMCID: PMC8419061 DOI: 10.7150/thno.62642] [Citation(s) in RCA: 41] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 07/30/2021] [Indexed: 12/18/2022] Open
Abstract
The approval of the first small interfering RNA (siRNA) drug Patisiran by FDA in 2018 marks a new era of RNA interference (RNAi) therapeutics. MicroRNAs (miRNA), an important post-transcriptional gene regulator, are also the subject of both basic research and clinical trials. Both siRNA and miRNA mimics are ~21 nucleotides RNA duplexes inducing mRNA silencing. Given the well performance of siRNA, researchers ask whether miRNA mimics are unnecessary or developed siRNA technology can pave the way for the emergence of miRNA mimic drugs. Through comprehensive comparison of siRNA and miRNA, we focus on (1) the common features and lessons learnt from the success of siRNAs; (2) the unique characteristics of miRNA that potentially offer additional therapeutic advantages and opportunities; (3) key areas of ongoing research that will contribute to clinical application of miRNA mimics. In conclusion, miRNA mimics have unique properties and advantages which cannot be fully matched by siRNA in clinical applications. MiRNAs are endogenous molecules and the gene silencing effects of miRNA mimics can be regulated or buffered to ameliorate or eliminate off-target effects. An in-depth understanding of the differences between siRNA and miRNA mimics will facilitate the development of miRNA mimic drugs.
Collapse
|
47
|
Analysis of SINE Families B2, Dip, and Ves with Special Reference to Polyadenylation Signals and Transcription Terminators. Int J Mol Sci 2021; 22:ijms22189897. [PMID: 34576060 PMCID: PMC8466645 DOI: 10.3390/ijms22189897] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/05/2021] [Accepted: 09/06/2021] [Indexed: 01/09/2023] Open
Abstract
Short Interspersed Elements (SINEs) are eukaryotic non-autonomous retrotransposons transcribed by RNA polymerase III (pol III). The 3′-terminus of many mammalian SINEs has a polyadenylation signal (AATAAA), pol III transcription terminator, and A-rich tail. The RNAs of such SINEs can be polyadenylated, which is unique for pol III transcripts. Here, B2 (mice and related rodents), Dip (jerboas), and Ves (vespertilionid bats) SINE families were thoroughly studied. They were divided into subfamilies reliably distinguished by relatively long indels. The age of SINE subfamilies can be estimated, which allows us to reconstruct their evolution. The youngest and most active variants of SINE subfamilies were given special attention. The shortest pol III transcription terminators are TCTTT (B2), TATTT (Ves and Dip), and the rarer TTTT. The last nucleotide of the terminator is often not transcribed; accordingly, the truncated terminator of its descendant becomes nonfunctional. The incidence of complete transcription of the TCTTT terminator is twice higher compared to TTTT and thus functional terminators are more likely preserved in daughter SINE copies. Young copies have long poly(A) tails; however, they gradually shorten in host generations. Unexpectedly, the tail shortening below A10 increases the incidence of terminator elongation by Ts thus restoring its efficiency. This process can be critical for the maintenance of SINE activity in the genome.
Collapse
|
48
|
Magnusson JP, Rios AR, Wu L, Qi LS. Enhanced Cas12a multi-gene regulation using a CRISPR array separator. eLife 2021; 10:e66406. [PMID: 34499031 PMCID: PMC8478413 DOI: 10.7554/elife.66406] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2021] [Accepted: 09/08/2021] [Indexed: 12/26/2022] Open
Abstract
The type V-A Cas12a protein can process its CRISPR array, a feature useful for multiplexed gene editing and regulation. However, CRISPR arrays often exhibit unpredictable performance due to interference between multiple guide RNA (gRNAs). Here, we report that Cas12a array performance is hypersensitive to the GC content of gRNA spacers, as high-GC spacers can impair activity of the downstream gRNA. We analyze naturally occurring CRISPR arrays and observe that natural repeats always contain an AT-rich fragment that separates gRNAs, which we term a CRISPR separator. Inspired by this observation, we design short, AT-rich synthetic separators (synSeparators) that successfully remove the disruptive effects between gRNAs. We further demonstrate enhanced simultaneous activation of seven endogenous genes in human cells using an array containing the synSeparator. These results elucidate a previously underexplored feature of natural CRISPR arrays and demonstrate how nature-inspired engineering solutions can improve multi-gene control in mammalian cells.
Collapse
Affiliation(s)
- Jens P Magnusson
- Department of Bioengineering, Stanford UniversityStanfordUnited States
| | - Antonio Ray Rios
- Department of Bioengineering, Stanford UniversityStanfordUnited States
| | - Lingling Wu
- Department of Bioengineering, Stanford UniversityStanfordUnited States
| | - Lei S Qi
- Department of Bioengineering, Stanford UniversityStanfordUnited States
- Department of Chemical and Systems Biology, Stanford UniversityStanfordUnited States
- Stanford ChEM-H Institute, Stanford UniversityStanfordUnited States
| |
Collapse
|
49
|
Adenine base editing of the DUX4 polyadenylation signal for targeted genetic therapy in facioscapulohumeral muscular dystrophy. MOLECULAR THERAPY. NUCLEIC ACIDS 2021; 25:342-354. [PMID: 34484861 PMCID: PMC8399085 DOI: 10.1016/j.omtn.2021.05.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 05/26/2021] [Indexed: 12/26/2022]
Abstract
Facioscapulohumeral muscular dystrophy (FSHD) is caused by chromatin relaxation of the D4Z4 repeat resulting in misexpression of the D4Z4-encoded DUX4 gene in skeletal muscle. One of the key genetic requirements for the stable production of full-length DUX4 mRNA in skeletal muscle is a functional polyadenylation signal (ATTAAA) in exon three of DUX4 that is used in somatic cells. Base editors hold great promise to treat DNA lesions underlying genetic diseases through their ability to carry out specific and rapid nucleotide mutagenesis even in postmitotic cells such as skeletal muscle. In this study, we present a simple and straightforward strategy for mutagenesis of the somatic DUX4 polyadenylation signal by adenine base editing in immortalized myoblasts derived from independent FSHD-affected individuals. We show that mutating this critical cis-regulatory element results in downregulation of DUX4 mRNA and its direct transcriptional target genes. Our findings identify the somatic DUX4 polyadenylation signal as a therapeutic target and represent the first step toward clinical application of the CRISPR-Cas9 base editing platform for FSHD gene therapy.
Collapse
|
50
|
Liu H, Moore CL. On the Cutting Edge: Regulation and Therapeutic Potential of the mRNA 3' End Nuclease. Trends Biochem Sci 2021; 46:772-784. [PMID: 33941430 PMCID: PMC8364479 DOI: 10.1016/j.tibs.2021.04.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/18/2021] [Accepted: 04/02/2021] [Indexed: 12/24/2022]
Abstract
Cleavage of nascent transcripts is a fundamental process for eukaryotic mRNA maturation and for the production of different mRNA isoforms. In eukaryotes, cleavage of mRNA precursors by the highly conserved endonuclease CPSF73 is critical for mRNA stability, export from the nucleus, and translation. As an essential enzyme in the cell, CPSF73 surprisingly shows promise as a drug target for specific cancers and for protozoan parasites. In this review, we cover our current understanding of CPSF73 in cleavage and polyadenylation, histone pre-mRNA processing, and transcription termination. We discuss the potential of CPSF73 as a target for novel therapeutics and highlight further research into the regulation of CPSF73 that will be critical to understanding its role in cancer and other diseases.
Collapse
Affiliation(s)
- Huiyun Liu
- Department of Developmental, Molecular, and Chemical Biology, Tufts University School of Medicine, Boston, MA 02111, USA
| | - Claire L Moore
- Department of Developmental, Molecular, and Chemical Biology, Tufts University School of Medicine, Boston, MA 02111, USA.
| |
Collapse
|