1
|
He H, Yang M, Li S, Zhang G, Ding Z, Zhang L, Shi G, Li Y. Mechanisms and biotechnological applications of transcription factors. Synth Syst Biotechnol 2023; 8:565-577. [PMID: 37691767 PMCID: PMC10482752 DOI: 10.1016/j.synbio.2023.08.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2023] [Revised: 08/15/2023] [Accepted: 08/27/2023] [Indexed: 09/12/2023] Open
Abstract
Transcription factors play an indispensable role in maintaining cellular viability and finely regulating complex internal metabolic networks. These crucial bioactive functions rely on their ability to respond to effectors and concurrently interact with binding sites. Recent advancements have brought innovative insights into the understanding of transcription factors. In this review, we comprehensively summarize the mechanisms by which transcription factors carry out their functions, along with calculation and experimental-based methods employed in their identification. Additionally, we highlight recent achievements in the application of transcription factors in various biotechnological fields, including cell engineering, human health, and biomanufacturing. Finally, the current limitations of research and provide prospects for future investigations are discussed. This review will provide enlightening theoretical guidance for transcription factors engineering.
Collapse
Affiliation(s)
- Hehe He
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Mingfei Yang
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Siyu Li
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Gaoyang Zhang
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Zhongyang Ding
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Liang Zhang
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Guiyang Shi
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| | - Youran Li
- Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, Jiangsu Province 214122, PR China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
- Jiangsu Provisional Research Center for Bioactive Product Processing Technology, Jiangnan University, 1800 Lihu Avenue, Wuxi, Jiangsu Province 214122, PR China
| |
Collapse
|
2
|
Yen A, Mateusiak C, Sarafinovska S, Gachechiladze MA, Guo J, Chen X, Moudgil A, Cammack AJ, Hoisington-Lopez J, Crosby M, Brent MR, Mitra RD, Dougherty JD. Calling Cards: A Customizable Platform to Longitudinally Record Protein-DNA Interactions Over Time in Cells and Tissues. Curr Protoc 2023; 3:e883. [PMID: 37755132 PMCID: PMC10627244 DOI: 10.1002/cpz1.883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2023]
Abstract
Calling Cards is a platform technology to record a cumulative history of transient protein-DNA interactions in the genome of genetically targeted cell types. The record of these interactions is recovered by next-generation sequencing. Compared with other genomic assays, readouts of which provide a snapshot at the time of harvest, Calling Cards enables correlation of historical molecular states to eventual outcomes or phenotypes. To achieve this, Calling Cards uses the piggyBac transposase to insert self-reporting transposon "Calling Cards" into the genome, leaving permanent marks at interaction sites. Calling Cards can be deployed in a variety of in vitro and in vivo biological systems to study gene regulatory networks involved in development, aging, and disease. Out of the box, it assesses enhancer usage but can be adapted to profile-specific transcription factor (TF) binding with custom TF-piggyBac fusion proteins. The Calling Cards workflow has five main stages: delivery of Calling Cards reagents, sample preparation, library preparation, sequencing, and data analysis. Here, we first present a comprehensive guide for experimental design, reagent selection, and optional customization of the platform to study additional TFs. Then, we provide an updated protocol for the five steps, using reagents that improve throughput and decrease costs, including an overview of a newly deployed computational pipeline. This protocol is designed for users with basic molecular biology experience to process samples into sequencing libraries in 2 days. Familiarity with bioinformatic analysis and command line tools is required to set up the pipeline in a high-performance computing environment and to conduct downstream analyses. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Preparation and delivery of Calling Cards reagents Support Protocol 1: Next-generation sequencing quantification of barcode distribution within self-reporting transposon plasmid pool and adeno-associated virus genome Basic Protocol 2: Sample collection and RNA purification Support Protocol 2: Library density quantitative PCR Basic Protocol 3: Sequencing library preparation Basic Protocol 4: Library pooling and sequencing Basic Protocol 5: Data analysis.
Collapse
Affiliation(s)
- Allen Yen
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
| | - Chase Mateusiak
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Simona Sarafinovska
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
| | - Mariam A. Gachechiladze
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
| | - Juanru Guo
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Xuhua Chen
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Arnav Moudgil
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Alexander J. Cammack
- Department of Neurology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Jessica Hoisington-Lopez
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - MariaLynn Crosby
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Michael R. Brent
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Computer Science and Engineering, Washington University, Saint Louis, MO 63130
| | - Robi D. Mitra
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Joseph D. Dougherty
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
- Intellectual and Developmental Disabilities Research Center, Washington University School of Medicine, Saint Louis, MO 63110
- Lead contact
| |
Collapse
|
3
|
Yen A, Mateusiak C, Sarafinovska S, Gachechiladze MA, Guo J, Chen X, Moudgil A, Cammack AJ, Hoisington-Lopez J, Crosby M, Brent MR, Mitra RD, Dougherty JD. Calling Cards: a customizable platform to longitudinally record protein-DNA interactions over time in cells and tissues. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.07.544098. [PMID: 37333130 PMCID: PMC10274760 DOI: 10.1101/2023.06.07.544098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Calling Cards is a platform technology to record a cumulative history of transient protein-DNA interactions in the genome of genetically targeted cell types. The record of these interactions is recovered by next generation sequencing. Compared to other genomic assays, whose readout provides a snapshot at the time of harvest, Calling Cards enables correlation of historical molecular states to eventual outcomes or phenotypes. To achieve this, Calling Cards uses the piggyBac transposase to insert self-reporting transposon (SRT) "Calling Cards" into the genome, leaving permanent marks at interaction sites. Calling Cards can be deployed in a variety of in vitro and in vivo biological systems to study gene regulatory networks involved in development, aging, and disease. Out of the box, it assesses enhancer usage but can be adapted to profile specific transcription factor binding with custom transcription factor (TF)-piggyBac fusion proteins. The Calling Cards workflow has five main stages: delivery of Calling Card reagents, sample preparation, library preparation, sequencing, and data analysis. Here, we first present a comprehensive guide for experimental design, reagent selection, and optional customization of the platform to study additional TFs. Then, we provide an updated protocol for the five steps, using reagents that improve throughput and decrease costs, including an overview of a newly deployed computational pipeline. This protocol is designed for users with basic molecular biology experience to process samples into sequencing libraries in 1-2 days. Familiarity with bioinformatic analysis and command line tools is required to set up the pipeline in a high-performance computing environment and to conduct downstream analyses. Basic Protocol 1: Preparation and delivery of Calling Cards reagentsBasic Protocol 2: Sample preparationBasic Protocol 3: Sequencing library preparationBasic Protocol 4: Library pooling and sequencingBasic Protocol 5: Data analysis.
Collapse
Affiliation(s)
- Allen Yen
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
| | - Chase Mateusiak
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Simona Sarafinovska
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
| | - Mariam A Gachechiladze
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
| | - Juanru Guo
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Xuhua Chen
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Arnav Moudgil
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Alexander J Cammack
- Department of Neurology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Jessica Hoisington-Lopez
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - MariaLynn Crosby
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Michael R Brent
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Computer Science and Engineering, Washington University, Saint Louis, MO 63130
| | - Robi D Mitra
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO 63110
| | - Joseph D Dougherty
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO 63110
- Department of Psychiatry, Washington University School of Medicine, Saint Louis, MO 63110
- Intellectual and Developmental Disabilities Research Center, Washington University School of Medicine, Saint Louis, MO 63110
- Lead contact
| |
Collapse
|
4
|
Abid D, Brent MR. NetProphet 3: a machine learning framework for transcription factor network mapping and multi-omics integration. Bioinformatics 2023; 39:7000334. [PMID: 36692138 PMCID: PMC9912366 DOI: 10.1093/bioinformatics/btad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 01/11/2023] [Accepted: 01/18/2023] [Indexed: 01/25/2023] Open
Abstract
MOTIVATION Many methods have been proposed for mapping the targets of transcription factors (TFs) from gene expression data. It is known that combining outputs from multiple methods can improve performance. To date, outputs have been combined by using either simplistic formulae, such as geometric mean, or carefully hand-tuned formulae that may not generalize well to new inputs. Finally, the evaluation of accuracy has been challenging due to the lack of genome-scale, ground-truth networks. RESULTS We developed NetProphet3, which combines scores from multiple analyses automatically, using a tree boosting algorithm trained on TF binding location data. We also developed three independent, genome-scale evaluation metrics. By these metrics, NetProphet3 is more accurate than other commonly used packages, including NetProphet 2.0, when gene expression data from direct TF perturbations are available. Furthermore, its integration mode can forge a consensus network from gene expression data and TF binding location data. AVAILABILITY AND IMPLEMENTATION All data and code are available at https://zenodo.org/record/7504131#.Y7Wu3i-B2x8. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dhoha Abid
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, MO 63130, USA
| | - Michael R Brent
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, MO 63130, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110, USA
| |
Collapse
|
5
|
A viral toolkit for recording transcription factor-DNA interactions in live mouse tissues. Proc Natl Acad Sci U S A 2020; 117:10003-10014. [PMID: 32300008 DOI: 10.1073/pnas.1918241117] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Transcription factors (TFs) enact precise regulation of gene expression through site-specific, genome-wide binding. Common methods for TF-occupancy profiling, such as chromatin immunoprecipitation, are limited by requirement of TF-specific antibodies and provide only end-point snapshots of TF binding. Alternatively, TF-tagging techniques, in which a TF is fused to a DNA-modifying enzyme that marks TF-binding events across the genome as they occur, do not require TF-specific antibodies and offer the potential for unique applications, such as recording of TF occupancy over time and cell type specificity through conditional expression of the TF-enzyme fusion. Here, we create a viral toolkit for one such method, calling cards, and demonstrate that these reagents can be delivered to the live mouse brain and used to report TF occupancy. Further, we establish a Cre-dependent calling cards system and, in proof-of-principle experiments, show utility in defining cell type-specific TF profiles and recording and integrating TF-binding events across time. This versatile approach will enable unique studies of TF-mediated gene regulation in live animal models.
Collapse
|
6
|
Kang Y, Patel NR, Shively C, Recio PS, Chen X, Wranik BJ, Kim G, McIsaac RS, Mitra R, Brent MR. Dual threshold optimization and network inference reveal convergent evidence from TF binding locations and TF perturbation responses. Genome Res 2020; 30:459-471. [PMID: 32060051 PMCID: PMC7111528 DOI: 10.1101/gr.259655.119] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2019] [Accepted: 02/11/2020] [Indexed: 12/22/2022]
Abstract
A high-confidence map of the direct, functional targets of each transcription factor (TF) requires convergent evidence from independent sources. Two significant sources of evidence are TF binding locations and the transcriptional responses to direct TF perturbations. Systematic data sets of both types exist for yeast and human, but they rarely converge on a common set of direct, functional targets for a TF. Even the few genes that are both bound and responsive may not be direct functional targets. Our analysis shows that when there are many nonfunctional binding sites and many indirect targets, nonfunctional sites are expected to occur in the cis-regulatory DNA of indirect targets by chance. To address this problem, we introduce dual threshold optimization (DTO), a new method for setting significance thresholds on binding and perturbation-response data, and show that it improves convergence. It also enables comparison of binding data to perturbation-response data that have been processed by network inference algorithms, which further improves convergence. The combination of dual threshold optimization and network inference greatly expands the high-confidence TF network map in both yeast and human. Next, we analyze a comprehensive new data set measuring the transcriptional response shortly after inducing overexpression of a yeast TF. We also present a new yeast binding location data set obtained by transposon calling cards and compare it to recent ChIP-exo data. These new data sets improve convergence and expand the high-confidence network synergistically.
Collapse
Affiliation(s)
- Yiming Kang
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| | - Nikhil R Patel
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| | - Christian Shively
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Pamela Samantha Recio
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Xuhua Chen
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Bernd J Wranik
- Calico Life Sciences LLC, South San Francisco, California 94080, USA
| | - Griffin Kim
- Calico Life Sciences LLC, South San Francisco, California 94080, USA
| | - R Scott McIsaac
- Calico Life Sciences LLC, South San Francisco, California 94080, USA
| | - Robi Mitra
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Genetics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | - Michael R Brent
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63110, USA.,Department of Computer Science and Engineering, Washington University, St. Louis, Missouri 63130, USA
| |
Collapse
|
7
|
Homotypic cooperativity and collective binding are determinants of bHLH specificity and function. Proc Natl Acad Sci U S A 2019; 116:16143-16152. [PMID: 31341088 DOI: 10.1073/pnas.1818015116] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Eukaryotic cells express transcription factor (TF) paralogues that bind to nearly identical DNA sequences in vitro but bind at different genomic loci and perform different functions in vivo. Predicting how 2 paralogous TFs bind in vivo using DNA sequence alone is an important open problem. Here, we analyzed 2 yeast bHLH TFs, Cbf1p and Tye7p, which have highly similar binding preferences in vitro, yet bind at almost completely nonoverlapping target loci in vivo. We dissected the determinants of specificity for these 2 proteins by making a number of chimeric TFs in which we swapped different domains of Cbf1p and Tye7p and determined the effects on in vivo binding and cellular function. From these experiments, we learned that the Cbf1p dimer achieves its specificity by binding cooperatively with other Cbf1p dimers bound nearby. In contrast, we found that Tye7p achieves its specificity by binding cooperatively with 3 other DNA-binding proteins, Gcr1p, Gcr2p, and Rap1p. Remarkably, most promoters (63%) that are bound by Tye7p do not contain a consensus Tye7p binding site. Using this information, we were able to build simple models to accurately discriminate bound and unbound genomic loci for both Cbf1p and Tye7p. We then successfully reprogrammed the human bHLH NPAS2 to bind Cbf1p in vivo targets and a Tye7p target intergenic region to be bound by Cbf1p. These results demonstrate that the genome-wide binding targets of paralogous TFs can be discriminated using sequence information, and provide lessons about TF specificity that can be applied across the phylogenetic tree.
Collapse
|
8
|
O Cdc7 kinase where art thou? Curr Genet 2017; 64:677-680. [PMID: 29134273 DOI: 10.1007/s00294-017-0782-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 11/07/2017] [Accepted: 11/09/2017] [Indexed: 10/18/2022]
Abstract
Although Cdc7 protein kinase is important for regulating DNA replication in all eukaryotes and is a target for cancer therapy, it has never been localized in cells. Recently, a novel molecular genomic method used by our laboratory to localize Cdc7 to regions of chromosomes. Originally, mutations in the CDC7 gene were found in the classic cdc mutant collection of Hartwell et al. (Genetics 74:267-286, 1973). The CDC7 gene was found to encode a protein kinase called DDK that has been studied for many years, establishing its precise role in the initiation of DNA replication at origins. Recently, clinical studies are underway with DDK inhibitors against DDK in cancer patients. However, the conundrum is that Cdc7 has never been detected at origins of replication even though many studies have suggested it should be there. We used "Calling Card" system in which DNA binding proteins are localized to the genome via retrotransposon insertion and deep-sequencing methods. We have shown that Cdc7 localizes at many regions of the genome and was enriched at functional origins of replication. These results are consistent with DDK's role in many additional genomic processes including mutagenesis, chromatid cohesion, and meiotic recombination. Thus, the main conclusion from our studies is that Cdc7 kinase is found at many locations in the genome, but is enriched at functional origins of replication. Furthermore, we propose that application of the Calling Card system to other eukaryotes should be useful in identification of functional origins in other eukaryotic cells.
Collapse
|
9
|
Localization of Cdc7 Protein Kinase During DNA Replication in Saccharomyces cerevisiae. G3-GENES GENOMES GENETICS 2017; 7:3757-3774. [PMID: 28924058 PMCID: PMC5677158 DOI: 10.1534/g3.117.300223] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
DDK, a conserved serine-threonine protein kinase composed of a regulatory subunit, Dbf4, and a catalytic subunit, Cdc7, is essential for DNA replication initiation during S phase of the cell cycle through MCM2-7 helicase phosphorylation. The biological significance of DDK is well characterized, but the full mechanism of how DDK associates with substrates remains unclear. Cdc7 is bound to chromatin in the Saccharomyces cerevisiae genome throughout the cell cycle, but there is little empirical evidence as to specific Cdc7 binding locations. Using biochemical and genetic techniques, this study investigated the specific localization of Cdc7 on chromatin. The Calling Cards method, using Ty5 retrotransposons as a marker for DNA–protein binding, suggests Cdc7 kinase is preferentially bound to genomic DNA known to replicate early in S phase, including centromeres and origins of replication. We also discovered Cdc7 binding throughout the genome, which may be necessary to initiate other cellular processes, including meiotic recombination and translesion synthesis. A kinase dead Cdc7 point mutation increases the Ty5 retrotransposon integration efficiency and a 55-amino acid C-terminal truncation of Cdc7, unable to bind Dbf4, reduces Cdc7 binding suggesting a requirement for Dbf4 to stabilize Cdc7 on chromatin during S phase. Chromatin immunoprecipitation demonstrates that Cdc7 binding near specific origins changes during S phase. Our results suggest a model where Cdc7 is loosely bound to chromatin during G1. At the G1/S transition, Cdc7 binding to chromatin is increased and stabilized, preferentially at sites that may become origins, in order to carry out a variety of cellular processes.
Collapse
|
10
|
Qi X, Daily K, Nguyen K, Wang H, Mayhew D, Rigor P, Forouzan S, Johnston M, Mitra RD, Baldi P, Sandmeyer S. Retrotransposon profiling of RNA polymerase III initiation sites. Genome Res 2012; 22:681-92. [PMID: 22287102 DOI: 10.1101/gr.131219.111] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Although retroviruses are relatively promiscuous in choice of integration sites, retrotransposons can display marked integration specificity. In yeast and slime mold, some retrotransposons are associated with tRNA genes (tDNAs). In the Saccharomyces cerevisiae genome, the long terminal repeat retrotransposon Ty3 is found at RNA polymerase III (Pol III) transcription start sites of tDNAs. Ty1, 2, and 4 elements also cluster in the upstream regions of these genes. To determine the extent to which other Pol III-transcribed genes serve as genomic targets for Ty3, a set of 10,000 Ty3 genomic retrotranspositions were mapped using high-throughput DNA sequencing. Integrations occurred at all known tDNAs, two tDNA relics (iYGR033c and ZOD1), and six non-tDNA, Pol III-transcribed types of genes (RDN5, SNR6, SNR52, RPR1, RNA170, and SCR1). Previous work in vitro demonstrated that the Pol III transcription factor (TF) IIIB is important for Ty3 targeting. However, seven loci that bind the TFIIIB loader, TFIIIC, were not targeted, underscoring the unexplained absence of TFIIIB at those sites. Ty3 integrations also occurred in two open reading frames not previously associated with Pol III transcription, suggesting the existence of a small number of additional sites in the yeast genome that interact with Pol III transcription complexes.
Collapse
Affiliation(s)
- Xiaojie Qi
- Department of Biological Chemistry, School of Medicine, University of California, Irvine, California 92697, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
11
|
Wang H, Mayhew D, Chen X, Johnston M, Mitra RD. Calling Cards enable multiplexed identification of the genomic targets of DNA-binding proteins. Genome Res 2011; 21:748-55. [PMID: 21471402 DOI: 10.1101/gr.114850.110] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Transcription factors direct gene expression, so there is much interest in mapping their genome-wide binding locations. Current methods do not allow for the multiplexed analysis of TF binding, and this limits their throughput. We describe a novel method for determining the genomic target genes of multiple transcription factors simultaneously. DNA-binding proteins are endowed with the ability to direct transposon insertions into the genome near to where they bind. The transposon becomes a "Calling Card" marking the visit of the DNA-binding protein to that location. A unique sequence "barcode" in the transposon matches it to the DNA-binding protein that directed its insertion. The sequences of the DNA flanking the transposon (which reveal where in the genome the transposon landed) and the barcode within the transposon (which identifies the TF that put it there) are determined by massively parallel DNA sequencing. To demonstrate the method's feasibility, we determined the genomic targets of eight transcription factors in a single experiment. The Calling Card method promises to significantly reduce the cost and labor needed to determine the genomic targets of many transcription factors in different environmental conditions and genetic backgrounds.
Collapse
Affiliation(s)
- Haoyi Wang
- Department of Genetics and Center for Genome Sciences and Systems Biology, Washington University, School of Medicine, St. Louis, Missouri 63108, USA
| | | | | | | | | |
Collapse
|
12
|
Ferris AL, Wu X, Hughes CM, Stewart C, Smith SJ, Milne TA, Wang GG, Shun MC, Allis CD, Engelman A, Hughes SH. Lens epithelium-derived growth factor fusion proteins redirect HIV-1 DNA integration. Proc Natl Acad Sci U S A 2010; 107:3135-40. [PMID: 20133638 PMCID: PMC2840313 DOI: 10.1073/pnas.0914142107] [Citation(s) in RCA: 117] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Lens epithelium-derived growth factor (LEDGF) fusion proteins can direct HIV-1 DNA integration to novel sites in the host genome. The C terminus of LEDGF contains an integrase binding domain (IBD), and the N terminus binds chromatin. LEDGF normally directs integrations to the bodies of expressed genes. Replacing the N terminus of LEDGF with chromatin binding domains (CBDs) from other proteins changes the specificity of HIV-1 DNA integration. We chose two well-characterized CBDs: the plant homeodomain (PHD) finger from ING2 and the chromodomain from heterochromatin binding protein 1alpha (HP1alpha). The ING2 PHD finger binds H3K4me3, a histone mark that is associated with the transcriptional start sites of expressed genes. The HP1alpha chromodomain binds H3K9me2,3, histone marks that are widely distributed throughout the genome. A fusion protein in which the ING2 PHD finger was linked to the LEDGF IBD directed integrations near the start sites of expressed genes. A similar fusion protein in which the HP1alpha chromodomain was linked to the LEDGF IBD directed integrations to sites that differed from both the PHD finger fusion-directed and LEDGF-directed integration sites. The ability to redirect HIV-1 DNA integration may help solve the problems associated with the activation of oncogenes when retroviruses are used in gene therapy.
Collapse
Affiliation(s)
- Andrea L. Ferris
- HIV Drug Resistance Program, National Cancer Institute, Frederick, MD 21702
| | - Xiaolin Wu
- Laboratory of Molecular Technology, SAIC-Frederick, Inc., Frederick, MD 21702
| | - Christina M. Hughes
- Laboratory of Chromatin Biology and Epigenetics, The Rockefeller University, New York, NY 10065; and
| | - Claudia Stewart
- Laboratory of Molecular Technology, SAIC-Frederick, Inc., Frederick, MD 21702
| | - Steven J. Smith
- HIV Drug Resistance Program, National Cancer Institute, Frederick, MD 21702
| | - Thomas A. Milne
- Laboratory of Chromatin Biology and Epigenetics, The Rockefeller University, New York, NY 10065; and
| | - Gang G. Wang
- Laboratory of Chromatin Biology and Epigenetics, The Rockefeller University, New York, NY 10065; and
| | - Ming-Chieh Shun
- Department of Cancer Immunology and AIDS, Dana-Farber Cancer Institute, Boston, MA 02115
| | - C. David Allis
- Laboratory of Chromatin Biology and Epigenetics, The Rockefeller University, New York, NY 10065; and
| | - Alan Engelman
- Department of Cancer Immunology and AIDS, Dana-Farber Cancer Institute, Boston, MA 02115
| | - Stephen H. Hughes
- HIV Drug Resistance Program, National Cancer Institute, Frederick, MD 21702
| |
Collapse
|