1
|
Carrion SA, Michal JJ, Jiang Z. Alternative Transcripts Diversify Genome Function for Phenome Relevance to Health and Diseases. Genes (Basel) 2023; 14:2051. [PMID: 38002994 PMCID: PMC10671453 DOI: 10.3390/genes14112051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/06/2023] [Accepted: 11/07/2023] [Indexed: 11/26/2023] Open
Abstract
Manipulation using alternative exon splicing (AES), alternative transcription start (ATS), and alternative polyadenylation (APA) sites are key to transcript diversity underlying health and disease. All three are pervasive in organisms, present in at least 50% of human protein-coding genes. In fact, ATS and APA site use has the highest impact on protein identity, with their ability to alter which first and last exons are utilized as well as impacting stability and translation efficiency. These RNA variants have been shown to be highly specific, both in tissue type and stage, with demonstrated importance to cell proliferation, differentiation and the transition from fetal to adult cells. While alternative exon splicing has a limited effect on protein identity, its ubiquity highlights the importance of these minor alterations, which can alter other features such as localization. The three processes are also highly interwoven, with overlapping, complementary, and competing factors, RNA polymerase II and its CTD (C-terminal domain) chief among them. Their role in development means dysregulation leads to a wide variety of disorders and cancers, with some forms of disease disproportionately affected by specific mechanisms (AES, ATS, or APA). Challenges associated with the genome-wide profiling of RNA variants and their potential solutions are also discussed in this review.
Collapse
Affiliation(s)
| | | | - Zhihua Jiang
- Department of Animal Sciences and Center for Reproductive Biology, Washington State University, Pullman, WA 99164-7620, USA; (S.A.C.); (J.J.M.)
| |
Collapse
|
2
|
Jonas F, Vidavski M, Benuck E, Barkai N, Yaakov G. Nucleosome retention by histone chaperones and remodelers occludes pervasive DNA-protein binding. Nucleic Acids Res 2023; 51:8496-8513. [PMID: 37493599 PMCID: PMC10484674 DOI: 10.1093/nar/gkad615] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 06/07/2023] [Accepted: 07/11/2023] [Indexed: 07/27/2023] Open
Abstract
DNA packaging within chromatin depends on histone chaperones and remodelers that form and position nucleosomes. Cells express multiple such chromatin regulators with overlapping in-vitro activities. Defining specific in-vivo activities requires monitoring histone dynamics during regulator depletion, which has been technically challenging. We have recently generated histone-exchange sensors in Saccharomyces cerevisiae, which we now use to define the contributions of 15 regulators to histone dynamics genome-wide. While replication-independent exchange in unperturbed cells maps to promoters, regulator depletions primarily affected gene bodies. Depletion of Spt6, Spt16 or Chd1 sharply increased nucleosome replacement sequentially at the beginning, middle or end of highly expressed gene bodies. They further triggered re-localization of chaperones to affected gene body regions, which compensated for nucleosome loss during transcription complex passage, but concurred with extensive TF binding in gene bodies. We provide a unified quantitative screen highlighting regulator roles in retaining nucleosome binding during transcription and preserving genomic packaging.
Collapse
Affiliation(s)
- Felix Jonas
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Matan Vidavski
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Eli Benuck
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Gilad Yaakov
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
3
|
Guzman C, Duttke S, Zhu Y, De Arruda Saldanha C, Downes N, Benner C, Heinz S. Combining TSS-MPRA and sensitive TSS profile dissimilarity scoring to study the sequence determinants of transcription initiation. Nucleic Acids Res 2023; 51:e80. [PMID: 37403796 PMCID: PMC10450201 DOI: 10.1093/nar/gkad562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/13/2023] [Accepted: 06/20/2023] [Indexed: 07/06/2023] Open
Abstract
Cis-regulatory elements (CREs) can be classified by the shapes of their transcription start site (TSS) profiles, which are indicative of distinct regulatory mechanisms. Massively parallel reporter assays (MPRAs) are increasingly being used to study CRE regulatory mechanisms, yet the degree to which MPRAs replicate individual endogenous TSS profiles has not been determined. Here, we present a new low-input MPRA protocol (TSS-MPRA) that enables measuring TSS profiles of episomal reporters as well as after lentiviral reporter chromatinization. To sensitively compare MPRA and endogenous TSS profiles, we developed a novel dissimilarity scoring algorithm (WIP score) that outperforms the frequently used earth mover's distance on experimental data. Using TSS-MPRA and WIP scoring on 500 unique reporter inserts, we found that short (153 bp) MPRA promoter inserts replicate the endogenous TSS patterns of ∼60% of promoters. Lentiviral reporter chromatinization did not improve fidelity of TSS-MPRA initiation patterns, and increasing insert size frequently led to activation of extraneous TSS in the MPRA that are not active in vivo. We discuss the implications of our findings, which highlight important caveats when using MPRAs to study transcription mechanisms. Finally, we illustrate how TSS-MPRA and WIP scoring can provide novel insights into the impact of transcription factor motif mutations and genetic variants on TSS patterns and transcription levels.
Collapse
Affiliation(s)
- Carlos Guzman
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
- Department of Bioengineering, Graduate Program in Bioinformatics & Systems Biology, U.C. San Diego, La Jolla, CA 92093, USA
| | - Sascha Duttke
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Yixin Zhu
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Camila De Arruda Saldanha
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Nicholas L Downes
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Christopher Benner
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Sven Heinz
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| |
Collapse
|
4
|
de Melo RN, de Souza Hassemer G, Steffens J, Junges A, Valduga E. Recent updates to microbial production and recovery of polyhydroxyalkanoates. 3 Biotech 2023; 13:204. [PMID: 37223002 PMCID: PMC10200728 DOI: 10.1007/s13205-023-03633-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Accepted: 05/12/2023] [Indexed: 05/25/2023] Open
Abstract
The increasing use of synthetic polymers and their disposal has raised concern due to their adverse effects on the environment. Thus, other sustainable alternatives to synthetic plastics have been sought, such as polyhydroxyalkanoates (PHAs), which are promising microbial polyesters, mainly due to their compostable nature, biocompatibility, thermostability, and resilience, making this biopolymer acceptable in several applications in the global market. The large-scale production of PHAs by microorganisms is still limited by the high cost of production compared to conventional plastics. This review reports some strategies mentioned in the literature aimed at production and recovery, paving the way for the bio-based economy. For this, some aspects of PHAs are addressed, such as synthesis, production systems, process control using by-products from industries, and advances and challenges in the downstream. The bioplastics properties made them a prime candidate for food, pharmaceutical, and chemical industrial applications. With this paper, it is possible to see that biodegradable polymers are promising materials, mainly for reducing the pollution produced by polymers derived from petroleum.
Collapse
Affiliation(s)
- Rafaela Nery de Melo
- Department of Food and Chemical Engineering, URI-Erechim, Sete de Setembro Av, Erechim, RS 162199709-910 Brazil
| | - Guilherme de Souza Hassemer
- Department of Food and Chemical Engineering, URI-Erechim, Sete de Setembro Av, Erechim, RS 162199709-910 Brazil
| | - Juliana Steffens
- Department of Food and Chemical Engineering, URI-Erechim, Sete de Setembro Av, Erechim, RS 162199709-910 Brazil
| | - Alexander Junges
- Department of Food and Chemical Engineering, URI-Erechim, Sete de Setembro Av, Erechim, RS 162199709-910 Brazil
| | - Eunice Valduga
- Department of Food and Chemical Engineering, URI-Erechim, Sete de Setembro Av, Erechim, RS 162199709-910 Brazil
| |
Collapse
|
5
|
Murray A, Vollmers C, Schmitz RJ. Smar2C2: A Simple and Efficient Protocol for the Identification of Transcription Start Sites. Curr Protoc 2023; 3:e705. [PMID: 36947693 DOI: 10.1002/cpz1.705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]
Abstract
Promoters and the noncoding sequences that drive their function are fundamental aspects of genes that are critical to their regulation. The transcription preinitiation complex binds and assembles on promoters where it facilitates transcription. The transcription start site (TSS) is located downstream of the promoter sequence and is defined as the location in the genome where polymerase begins transcribing DNA into RNA. Knowing the location of TSSs is useful for annotation of genes, identification of non-coding sequences important to gene regulation, detection of alternative TSSs, and understanding of 5' UTR content. Several existing techniques make it possible to accurately identify TSSs, but are often difficult to perform experimentally, require large amounts of input RNA, or are unable to identify a large number of TSSs from a single sample. Many of these protocols take advantage of template switching reverse transcriptases (TSRTs), which reliably place an adaptor at the 5' end of a first strand synthesis of cDNA. Here, we introduce a protocol that exploits TSRT activity combined with rolling circle amplification to identify TSSs with several unique advantages over existing methods. Sequence adaptors are placed on the 5' and 3' end of the full-length cDNA copy of a transcript. A splint compatible with those adaptors is then used to circularize the full-length cDNA. Linear DNA containing concatemers of the cDNA are generated using rolling circle amplification, and a sequencing library is formed by fragmenting the concatemers. This protocol is straightforward to execute, requiring limited bench time with relatively stable reagents. Using extremely low amounts of RNA input, this protocol produces large numbers of accurate, deduplicated TSSs genome wide. © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Splint generation Basic Protocol 2: RNA extraction Basic Protocol 3: cDNA synthesis Basic Protocol 4: cDNA circularization and amplification Basic Protocol 5: Library generation.
Collapse
Affiliation(s)
- Andrew Murray
- Department of Plant Biology, University of Georgia, Athens, Georgia
| | - Christopher Vollmers
- Deparment of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California
| | | |
Collapse
|
6
|
Mühlen D, Li X, Dovgusha O, Jäckle H, Günesdogan U. Recycling of parental histones preserves the epigenetic landscape during embryonic development. SCIENCE ADVANCES 2023; 9:eadd6440. [PMID: 36724233 PMCID: PMC9891698 DOI: 10.1126/sciadv.add6440] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 12/27/2022] [Indexed: 06/16/2023]
Abstract
Epigenetic inheritance during DNA replication requires an orchestrated assembly of nucleosomes from parental and newly synthesized histones. We analyzed Drosophila HisC mutant embryos harboring a deletion of all canonical histone genes, in which nucleosome assembly relies on parental histones from cell cycle 14 onward. Lack of new histone synthesis leads to more accessible chromatin and reduced nucleosome occupancy, since only parental histones are available. This leads to up-regulated and spurious transcription, whereas the control of the developmental transcriptional program is partially maintained. The genomic positions of modified parental histone H2A, H2B, and H3 are largely restored during DNA replication. However, parental histones with active marks become more dispersed within gene bodies, which is linked to transcription. Together, the results suggest that parental histones are recycled to preserve the epigenetic landscape during DNA replication in vivo.
Collapse
Affiliation(s)
- Dominik Mühlen
- University of Göttingen, Göttingen Center for Molecular Biosciences, Department of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany
- Max Planck Institute for Multidisciplinary Sciences, Department for Molecular Developmental Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - Xiaojuan Li
- University of Göttingen, Göttingen Center for Molecular Biosciences, Department of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany
| | - Oleksandr Dovgusha
- University of Göttingen, Göttingen Center for Molecular Biosciences, Department of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany
| | - Herbert Jäckle
- Max Planck Institute for Multidisciplinary Sciences, Department for Molecular Developmental Biology, Am Fassberg 11, 37077 Göttingen, Germany
| | - Ufuk Günesdogan
- University of Göttingen, Göttingen Center for Molecular Biosciences, Department of Developmental Biology, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany
- Max Planck Institute for Multidisciplinary Sciences, Department for Molecular Developmental Biology, Am Fassberg 11, 37077 Göttingen, Germany
| |
Collapse
|
7
|
Murray A, Mendieta JP, Vollmers C, Schmitz RJ. Simple and accurate transcriptional start site identification using Smar2C2 and examination of conserved promoter features. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:583-596. [PMID: 36030508 PMCID: PMC9827901 DOI: 10.1111/tpj.15957] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 08/12/2022] [Accepted: 08/22/2022] [Indexed: 06/15/2023]
Abstract
The precise and accurate identification and quantification of transcriptional start sites (TSSs) is key to understanding the control of transcription. The core promoter consists of the TSS and proximal non-coding sequences, which are critical in transcriptional regulation. Therefore, the accurate identification of TSSs is important for understanding the molecular regulation of transcription. Existing protocols for TSS identification are challenging and expensive, leaving high-quality data available for a small subset of organisms. This sparsity of data impairs study of TSS usage across tissues or in an evolutionary context. To address these shortcomings, we developed Smart-Seq2 Rolling Circle to Concatemeric Consensus (Smar2C2), which identifies and quantifies TSSs and transcription termination sites. Smar2C2 incorporates unique molecular identifiers that allowed for the identification of as many as 70 million sites, with no known upper limit. We have also generated TSS data sets from as little as 40 pg of total RNA, which was the smallest input tested. In this study, we used Smar2C2 to identify TSSs in Glycine max (soybean), Oryza sativa (rice), Sorghum bicolor (sorghum), Triticum aestivum (wheat) and Zea mays (maize) across multiple tissues. This wide panel of plant TSSs facilitated the identification of evolutionarily conserved features, such as novel patterns in the dinucleotides that compose the initiator element (Inr), that correlated with promoter expression levels across all species examined. We also discovered sequence variations in known promoter motifs that are positioned reliably close to the TSS, such as differences in the TATA box and in the Inr that may prove significant to our understanding and control of transcription initiation. Smar2C2 allows for the easy study of these critical sequences, providing a tool to facilitate discovery.
Collapse
Affiliation(s)
- Andrew Murray
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
| | | | - Chris Vollmers
- Deparment of Biomolecular EngineeringUniversity of California Santa CruzSanta CruzCA95064USA
| | | |
Collapse
|
8
|
Chou SP, Alexander AK, Rice EJ, Choate LA, Danko CG. Genetic dissection of the RNA polymerase II transcription cycle. eLife 2022; 11:78458. [PMID: 35775732 PMCID: PMC9286732 DOI: 10.7554/elife.78458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 06/30/2022] [Indexed: 11/20/2022] Open
Abstract
How DNA sequence affects the dynamics and position of RNA Polymerase II (Pol II) during transcription remains poorly understood. Here, we used naturally occurring genetic variation in F1 hybrid mice to explore how DNA sequence differences affect the genome-wide distribution of Pol II. We measured the position and orientation of Pol II in eight organs collected from heterozygous F1 hybrid mice using ChRO-seq. Our data revealed a strong genetic basis for the precise coordinates of transcription initiation and promoter proximal pause, allowing us to redefine molecular models of core transcriptional processes. Our results implicate DNA sequence, including both known and novel DNA sequence motifs, as key determinants of the position of Pol II initiation and pause. We report evidence that initiation site selection follows a stochastic process similar to Brownian motion along the DNA template. We found widespread differences in the position of transcription termination, which impact the primary structure and stability of mature mRNA. Finally, we report evidence that allelic changes in transcription often affect mRNA and ncRNA expression across broad genomic domains. Collectively, we reveal how DNA sequences shape core transcriptional processes at single nucleotide resolution in mammals.
Collapse
Affiliation(s)
- Shao-Pei Chou
- Baker Institute for Animal Health, Cornell University, Ithaca, United States
| | - Adriana K Alexander
- Baker Institute for Animal Health, Cornell University, Ithaca, United States
| | - Edward J Rice
- Baker Institute for Animal Health, Cornell University, Ithaca, United States
| | - Lauren A Choate
- Baker Institute for Animal Health, Cornell University, Ithaca, United States
| | - Charles G Danko
- Baker Institute for Animal Health, Cornell University, Ithaca, United States
| |
Collapse
|
9
|
Abstract
Transcription start site (TSS) usage is a critical factor in the regulation of gene expression. A number of methods for global TSS mapping have been developed, but barriers of expense, technical difficulty, time, and/or cost have limited their broader adoption. To address these issues, we developed Survey of TRanscription Initiation at Promoter Elements with high-throughput sequencing (STRIPE-seq). Requiring only three enzymatic steps with intervening bead cleanups, a STRIPE-seq library can be prepared from as little as 50 ng total RNA in ~5 h at a cost of ~$12 (US). In addition to profiling TSS usage, STRIPE-seq provides information on transcript levels that can be used for differential expression analysis. Thanks to its simplicity and low cost, we envision that STRIPE-seq could be employed by any molecular biology laboratory interested in profiling transcription initiation.
Collapse
Affiliation(s)
| | - Gabriel E Zentner
- Department of Biology, Indiana University, Bloomington, IN, USA.
- Indiana University Melvin and Bren Simon Comprehensive Cancer Center, Indianapolis, IN, USA.
- eGenesis, Inc., Cambridge, MA, USA.
| |
Collapse
|
10
|
Lu Z, Berry K, Hu Z, Zhan Y, Ahn TH, Lin Z. TSSr: an R package for comprehensive analyses of TSS sequencing data. NAR Genom Bioinform 2021; 3:lqab108. [PMID: 34805991 PMCID: PMC8598296 DOI: 10.1093/nargab/lqab108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 10/05/2021] [Accepted: 10/27/2021] [Indexed: 12/13/2022] Open
Abstract
Transcription initiation is regulated in a highly organized fashion to ensure proper cellular functions. Accurate identification of transcription start sites (TSSs) and quantitative characterization of transcription initiation activities are fundamental steps for studies of regulated transcriptions and core promoter structures. Several high-throughput techniques have been developed to sequence the very 5'end of RNA transcripts (TSS sequencing) on the genome scale. Bioinformatics tools are essential for processing, analysis, and visualization of TSS sequencing data. Here, we present TSSr, an R package that provides rich functions for mapping TSS and characterizations of structures and activities of core promoters based on all types of TSS sequencing data. Specifically, TSSr implements several newly developed algorithms for accurately identifying TSSs from mapped sequencing reads and inference of core promoters, which are a prerequisite for subsequent functional analyses of TSS data. Furthermore, TSSr also enables users to export various types of TSS data that can be visualized by genome browser for inspection of promoter activities in association with other genomic features, and to generate publication-ready TSS graphs. These user-friendly features could greatly facilitate studies of transcription initiation based on TSS sequencing data. The source code and detailed documentations of TSSr can be freely accessed at https://github.com/Linlab-slu/TSSr.
Collapse
Affiliation(s)
- Zhaolian Lu
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Keenan Berry
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Zhenbin Hu
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Yu Zhan
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Tae-Hyuk Ahn
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| |
Collapse
|
11
|
Abstract
Transcription start site (TSS) selection influences transcript stability and translation as well as protein sequence. Alternative TSS usage is pervasive in organismal development, is a major contributor to transcript isoform diversity in humans, and is frequently observed in human diseases including cancer. In this review, we discuss the breadth of techniques that have been used to globally profile TSSs and the resulting insights into gene regulation, as well as future prospects in this area of inquiry.
Collapse
Affiliation(s)
| | - Gabriel E. Zentner
- Department of Biology, Indiana University, Bloomington, IN 47401, USA
- Indiana University Melvin and Bren Simon Comprehensive Cancer Center, Indianapolis, IN 46202, USA
| |
Collapse
|