1
|
Brigant B, Metzinger-Le Meuth V, Boyartchuk V, Ouled-Haddou H, Guerrera IC, Rochette J, Metzinger L. A proteomic study of the downregulation of TRIM37 on chondrocytes: Implications for the MULIBREY syndrome. Bone 2024; 187:117205. [PMID: 39019132 DOI: 10.1016/j.bone.2024.117205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 07/12/2024] [Accepted: 07/14/2024] [Indexed: 07/19/2024]
Abstract
MULIBREY nanism which results from autosomal recessive mutations in TRIM37 impacts skeletal development, leading to growth delay with complications in multiple organs. In this study, we employed a combined proteomics and qPCR screening approach to investigate the molecular alterations in the CHON-002 cell line by comparing CHON-002 wild-type (WT) cells to CHON-002 TRIM37 knockdown (KD) cells. Our proteomic analysis demonstrated that TRIM37 depletion predominantly affects the expression of extracellular matrix proteins (ECM). Specifically, nanoLC-MS/MS experiments revealed an upregulation of SPARC, and collagen products (COL1A1, COL3A1, COL5A1) in response to TRIM37 KD. Concurrently, large-scale qPCR assays targeting osteogenesis-related genes corroborated these dysregulations of SPARC at the mRNA level. Gene ontology enrichment analysis highlighted the involvement of dysregulated proteins in ECM organization and TGF-β signaling pathways, indicating a role for TRIM37 in maintaining ECM integrity and regulating chondrocyte proliferation. These findings suggest that TRIM37 deficiency in chondrocytes change ECM protein composition and could impairs long bone growth, contributing to the pathophysiology of MULIBREY nanism.
Collapse
Affiliation(s)
- Benjamin Brigant
- HEMATIM UR-UPJV 4666, C.U.R.S, University of Picardie Jules Verne, 80000 Amiens, France; Centre of Molecular Inflammation Research (CEMIR), Department of Clinical Research and Molecular Medicine (IKOM), Faculty of Medicine and Health Sciences (MH), Norwegian University of Science and Technology (NTNU), Trondheim, Norway.
| | - Valérie Metzinger-Le Meuth
- INSERM UMRS 1148, Laboratory for Vascular Translational Science (LVTS), UFR SMBH, University of Sorbonne Paris Nord, 93000 Bobigny, France
| | - Victor Boyartchuk
- Centre of Molecular Inflammation Research (CEMIR), Department of Clinical Research and Molecular Medicine (IKOM), Faculty of Medicine and Health Sciences (MH), Norwegian University of Science and Technology (NTNU), Trondheim, Norway; Surgery Clinic, St. Olav's Hospital HF, Trondheim, Norway; Centre for Integrative Genetics, Department of Animal and Aquacultural Sciences, Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
| | - Hakim Ouled-Haddou
- HEMATIM UR-UPJV 4666, C.U.R.S, University of Picardie Jules Verne, 80000 Amiens, France
| | - Ida Chiara Guerrera
- Proteomics Platform Necker, Université Paris Cité-Structure Fédérative de Recherche Necker, INSERM US24/CNRS UAR3633, 75015, Paris, France
| | - Jacques Rochette
- HEMATIM UR-UPJV 4666, C.U.R.S, University of Picardie Jules Verne, 80000 Amiens, France
| | - Laurent Metzinger
- HEMATIM UR-UPJV 4666, C.U.R.S, University of Picardie Jules Verne, 80000 Amiens, France.
| |
Collapse
|
2
|
Harracksingh AN, Singh A, Mayorova TD, Bejoy B, Hornbeck J, Elkhatib W, McEdwards G, Gauberg J, Taha A, Islam IM, Erclik T, Currie MA, Noyes M, Senatore A. Mint/X11 PDZ domains from non-bilaterian animals recognize and bind Ca V2 calcium channel C-termini in vitro. Sci Rep 2024; 14:21615. [PMID: 39284887 PMCID: PMC11405698 DOI: 10.1038/s41598-024-70652-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Accepted: 08/20/2024] [Indexed: 09/22/2024] Open
Abstract
PDZ domain mediated interactions with voltage-gated calcium (CaV) channel C-termini play important roles in localizing membrane Ca2+ signaling. The first such interaction was described between the scaffolding protein Mint-1 and CaV2.2 in mammals. In this study, we show through various in silico analyses that Mint is an animal-specific gene with a highly divergent N-terminus but a strongly conserved C-terminus comprised of a phosphotyrosine binding domain, two tandem PDZ domains (PDZ-1 and PDZ-2), and a C-terminal auto-inhibitory element that binds and inhibits PDZ-1. In addition to CaV2 chanels, most genes that interact with Mint are also deeply conserved including amyloid precursor proteins, presenilins, neurexin, and CASK and Veli which form a tripartite complex with Mint in bilaterians. Through yeast and bacterial 2-hybrid experiments, we show that Mint and CaV2 channels from cnidarians and placozoans interact in vitro, and in situ hybridization revealed co-expression in dissociated neurons from the cnidarian Nematostella vectensis. Unexpectedly, the Mint orthologue from the ctenophore Hormiphora californiensis strongly bound the divergent C-terminal ligands of cnidarian and placozoan CaV2 channels, despite neither the ctenophore Mint, nor the placozoan and cnidarian orthologues, binding the ctenophore CaV2 channel C-terminus. Altogether, our analyses suggest that the capacity of Mint to bind CaV2 channels predates bilaterian animals, and that evolutionary changes in CaV2 channel C-terminal sequences resulted in altered binding modalities with Mint.
Collapse
Affiliation(s)
- Alicia N Harracksingh
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Anhadvir Singh
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Tatiana D Mayorova
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
- NINDS, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Brian Bejoy
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Jillian Hornbeck
- Institute for Systems Genetics, NYU Grossman School of Medicine, 550 1st Ave, New York, NY, 10016, USA
| | - Wassim Elkhatib
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Gregor McEdwards
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Julia Gauberg
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Abdul Taha
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Ishrat Maliha Islam
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Ted Erclik
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Mark A Currie
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada
| | - Marcus Noyes
- Institute for Systems Genetics, NYU Grossman School of Medicine, 550 1st Ave, New York, NY, 10016, USA
| | - Adriano Senatore
- Department of Biology, University of Toronto Mississauga, 3359 Mississauga Road, Mississauga, ON, L5L 1C6, Canada.
| |
Collapse
|
3
|
Harracksingh AN, Singh A, Mayorova T, Bejoy B, Hornbeck J, Elkhatib W, McEdwards G, Gauberg J, Taha ARW, Islam IM, Erclik T, Currie MA, Noyes M, Senatore A. Mint/X11 PDZ domains from non-bilaterian animals recognize and bind Ca V 2 calcium channel C-termini in vitro . BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.26.582151. [PMID: 38463976 PMCID: PMC10925089 DOI: 10.1101/2024.02.26.582151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
PDZ domain mediated interactions with voltage-gated calcium (Ca V ) channel C-termini play important roles in localizing membrane Ca 2+ signaling. The first such interaction was described between the scaffolding protein Mint-1 and Ca V 2.2 in mammals. In this study, we show through various in silico analyses that Mint is an animal-specific gene with a highly divergent N-terminus but a strongly conserved C-terminus comprised of a phosphotyrosine binding domain, two tandem PDZ domains (PDZ-1 and PDZ-2), and a C-terminal auto-inhibitory element that binds and inhibits PDZ-1. In addition to Ca V 2 channels, most genes that interact with Mint are also deeply conserved including amyloid precursor proteins, presenilins, neurexin, and CASK and Veli which form a tripartite complex with Mint in bilaterians. Through yeast and bacterial 2-hybrid experiments, we show that Mint and Ca V 2 channels from cnidarians and placozoans interact in vitro , and in situ hybridization revealed co-expression in dissociated neurons from the cnidarian Nematostella vectensis . Unexpectedly, the Mint orthologue from the ctenophore Hormiphora californiensis strongly binds the divergent C-terminal ligands of cnidarian and placozoan Ca V 2 channels, despite neither the ctenophore Mint, nor the placozoan and cnidarian orthologues, binding the ctenophore Ca V 2 channel C-terminus. Altogether, our analyses suggest that the capacity of Mint to bind CaV2 channels predates pre-bilaterian animals, and that evolutionary changes in Ca V 2 channel C-terminal sequences resulted in altered binding modalities with Mint.
Collapse
|
4
|
Bonchuk AN, Georgiev PG. C2H2 proteins: Evolutionary aspects of domain architecture and diversification. Bioessays 2024; 46:e2400052. [PMID: 38873893 DOI: 10.1002/bies.202400052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Revised: 05/24/2024] [Accepted: 05/27/2024] [Indexed: 06/15/2024]
Abstract
The largest group of transcription factors in higher eukaryotes are C2H2 proteins, which contain C2H2-type zinc finger domains that specifically bind to DNA. Few well-studied C2H2 proteins, however, demonstrate their key role in the control of gene expression and chromosome architecture. Here we review the features of the domain architecture of C2H2 proteins and the likely origin of C2H2 zinc fingers. A comprehensive investigation of proteomes for the presence of proteins with multiple clustered C2H2 domains has revealed a key difference between groups of organisms. Unlike plants, transcription factors in metazoans contain clusters of C2H2 domains typically separated by a linker with the TGEKP consensus sequence. The average size of C2H2 clusters varies substantially, even between genomes of higher metazoans, and with a tendency to increase in combination with SCAN, and especially KRAB domains, reflecting the increasing complexity of gene regulatory networks.
Collapse
Affiliation(s)
- Artem N Bonchuk
- Department of the Control of Genetic Processes, Institute of Gene Biology Russian Academy of Sciences, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Pavel G Georgiev
- Department of the Control of Genetic Processes, Institute of Gene Biology Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
5
|
Zhang X, Blumenthal RM, Cheng X. Keep Fingers on the CpG Islands. EPIGENOMES 2024; 8:23. [PMID: 38920624 PMCID: PMC11202855 DOI: 10.3390/epigenomes8020023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 05/31/2024] [Accepted: 06/14/2024] [Indexed: 06/27/2024] Open
Abstract
The post-genomic era has ushered in the extensive application of epigenetic editing tools, allowing for precise alterations of gene expression. The use of reprogrammable editors that carry transcriptional corepressors has significant potential for long-term epigenetic silencing for the treatment of human diseases. The ideal scenario involves precise targeting of a specific genomic location by a DNA-binding domain, ensuring there are no off-target effects and that the process yields no genetic remnants aside from specific epigenetic modifications (i.e., DNA methylation). A notable example is a recent study on the mouse Pcsk9 gene, crucial for cholesterol regulation and expressed in hepatocytes, which identified synthetic zinc-finger (ZF) proteins as the most effective DNA-binding editors for silencing Pcsk9 efficiently, specifically, and persistently. This discussion focuses on enhancing the specificity of ZF-array DNA binding by optimizing interactions between specific amino acids and DNA bases across three promoters containing CpG islands.
Collapse
Affiliation(s)
- Xing Zhang
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| | - Robert M. Blumenthal
- Department of Medical Microbiology and Immunology, and Program in Bioinformatics, The University of Toledo College of Medicine and Life Sciences, Toledo, OH 43614, USA;
| | - Xiaodong Cheng
- Department of Epigenetics and Molecular Carcinogenesis, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA;
| |
Collapse
|
6
|
Oriol F, Alberto M, Joachim AP, Patrick G, M BP, Ruben MF, Jaume B, Altair CH, Ferran P, Oriol G, Narcis FF, Baldo O. Structure-based learning to predict and model protein-DNA interactions and transcription-factor co-operativity in cis-regulatory elements. NAR Genom Bioinform 2024; 6:lqae068. [PMID: 38867914 PMCID: PMC11167492 DOI: 10.1093/nargab/lqae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 04/18/2024] [Accepted: 05/23/2024] [Indexed: 06/14/2024] Open
Abstract
Transcription factor (TF) binding is a key component of genomic regulation. There are numerous high-throughput experimental methods to characterize TF-DNA binding specificities. Their application, however, is both laborious and expensive, which makes profiling all TFs challenging. For instance, the binding preferences of ∼25% human TFs remain unknown; they neither have been determined experimentally nor inferred computationally. We introduce a structure-based learning approach to predict the binding preferences of TFs and the automated modelling of TF regulatory complexes. We show the advantage of using our approach over the classical nearest-neighbor prediction in the limits of remote homology. Starting from a TF sequence or structure, we predict binding preferences in the form of motifs that are then used to scan a DNA sequence for occurrences. The best matches are either profiled with a binding score or collected for their subsequent modeling into a higher-order regulatory complex with DNA. Co-operativity is modelled by: (i) the co-localization of TFs and (ii) the structural modeling of protein-protein interactions between TFs and with co-factors. We have applied our approach to automatically model the interferon-β enhanceosome and the pioneering complexes of OCT4, SOX2 (or SOX11) and KLF4 with a nucleosome, which are compared with the experimentally known structures.
Collapse
Affiliation(s)
- Fornes Oriol
- Centre for Molecular Medicine and Therapeutics. BC Children's Hospital Research Institute. Department of Medical Genetics. University of British Columbia, Vancouver, BC V5Z 4H4, Canada
| | - Meseguer Alberto
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | | | - Gohl Patrick
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Bota Patricia M
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Molina-Fernández Ruben
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Bonet Jaume
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
- Laboratory of Protein Design & Immunoengineering. School of Engineering. Ecole Polytechnique Federale de Lausanne. Lausanne 1015, Vaud, Switzerland
| | - Chinchilla-Hernandez Altair
- Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Pegenaute Ferran
- Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Gallego Oriol
- Live-Cell Structural Biology. Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| | - Fernandez-Fuentes Narcis
- Institute of Biological, Environmental and Rural Science. Aberystwyth University, SY23 3DA Aberystwyth, UK
| | - Oliva Baldo
- Structural Bioinformatics Lab (GRIB-IMIM). Department of Medicine and Life Sciences, Universitat Pompeu Fabra, Barcelona 08005 Catalonia, Spain
| |
Collapse
|
7
|
Yang J, Horton JR, Liu B, Corces VG, Blumenthal RM, Zhang X, Cheng X. Structures of CTCF-DNA complexes including all 11 zinc fingers. Nucleic Acids Res 2023; 51:8447-8462. [PMID: 37439339 PMCID: PMC10484683 DOI: 10.1093/nar/gkad594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 06/27/2023] [Accepted: 07/03/2023] [Indexed: 07/14/2023] Open
Abstract
The CCCTC-binding factor (CTCF) binds tens of thousands of enhancers and promoters on mammalian chromosomes by means of its 11 tandem zinc finger (ZF) DNA-binding domain. In addition to the 12-15-bp CORE sequence, some of the CTCF binding sites contain 5' upstream and/or 3' downstream motifs. Here, we describe two structures for overlapping portions of human CTCF, respectively, including ZF1-ZF7 and ZF3-ZF11 in complex with DNA that incorporates the CORE sequence together with either 3' downstream or 5' upstream motifs. Like conventional tandem ZF array proteins, ZF1-ZF7 follow the right-handed twist of the DNA, with each finger occupying and recognizing one triplet of three base pairs in the DNA major groove. ZF8 plays a unique role, acting as a spacer across the DNA minor groove and positioning ZF9-ZF11 to make cross-strand contacts with DNA. We ascribe the difference between the two subgroups of ZF1-ZF7 and ZF8-ZF11 to residues at the two positions -6 and -5 within each finger, with small residues for ZF1-ZF7 and bulkier and polar/charged residues for ZF8-ZF11. ZF8 is also uniquely rich in basic amino acids, which allows salt bridges to DNA phosphates in the minor groove. Highly specific arginine-guanine and glutamine-adenine interactions, used to recognize G:C or A:T base pairs at conventional base-interacting positions of ZFs, also apply to the cross-strand interactions adopted by ZF9-ZF11. The differences between ZF1-ZF7 and ZF8-ZF11 can be rationalized structurally and may contribute to recognition of high-affinity CTCF binding sites.
Collapse
Affiliation(s)
- Jie Yang
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - John R Horton
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Bin Liu
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Victor G Corces
- Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Robert M Blumenthal
- Department of Medical Microbiology and Immunology, and Program in Bioinformatics, The University of Toledo College of Medicine and Life Sciences, Toledo, OH 43614, USA
| | - Xing Zhang
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Xiaodong Cheng
- Department of Epigenetics and Molecular Carcinogenesis, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| |
Collapse
|
8
|
Ichikawa DM, Abdin O, Alerasool N, Kogenaru M, Mueller AL, Wen H, Giganti DO, Goldberg GW, Adams S, Spencer JM, Razavi R, Nim S, Zheng H, Gionco C, Clark FT, Strokach A, Hughes TR, Lionnet T, Taipale M, Kim PM, Noyes MB. A universal deep-learning model for zinc finger design enables transcription factor reprogramming. Nat Biotechnol 2023; 41:1117-1129. [PMID: 36702896 PMCID: PMC10421740 DOI: 10.1038/s41587-022-01624-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 11/17/2022] [Indexed: 01/27/2023]
Abstract
Cys2His2 zinc finger (ZF) domains engineered to bind specific target sequences in the genome provide an effective strategy for programmable regulation of gene expression, with many potential therapeutic applications. However, the structurally intricate engagement of ZF domains with DNA has made their design challenging. Here we describe the screening of 49 billion protein-DNA interactions and the development of a deep-learning model, ZFDesign, that solves ZF design for any genomic target. ZFDesign is a modern machine learning method that models global and target-specific differences induced by a range of library environments and specifically takes into account compatibility of neighboring fingers using a novel hierarchical transformer architecture. We demonstrate the versatility of designed ZFs as nucleases as well as activators and repressors by seamless reprogramming of human transcription factors. These factors could be used to upregulate an allele of haploinsufficiency, downregulate a gain-of-function mutation or test the consequence of regulation of a single gene as opposed to the many genes that a transcription factor would normally influence.
Collapse
Affiliation(s)
- David M Ichikawa
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
- Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, USA
| | - Osama Abdin
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Nader Alerasool
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Manjunatha Kogenaru
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - April L Mueller
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Han Wen
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - David O Giganti
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Gregory W Goldberg
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Samantha Adams
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Jeffrey M Spencer
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Rozita Razavi
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Satra Nim
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Hong Zheng
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Courtney Gionco
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Finnegan T Clark
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Alexey Strokach
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada
| | - Timothy R Hughes
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Timothee Lionnet
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA
| | - Mikko Taipale
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Philip M Kim
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
| | - Marcus B Noyes
- Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
- Department of Biochemistry and Molecular Pharmacology, NYU Grossman School of Medicine, New York, NY, USA.
| |
Collapse
|
9
|
Wetzel JL, Zhang K, Singh M. Learning probabilistic protein-DNA recognition codes from DNA-binding specificities using structural mappings. Genome Res 2022; 32:1776-1786. [PMID: 36123148 PMCID: PMC9528988 DOI: 10.1101/gr.276606.122] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 07/30/2022] [Indexed: 11/25/2022]
Abstract
Knowledge of how proteins interact with DNA is essential for understanding gene regulation. Although DNA-binding specificities for thousands of transcription factors (TFs) have been determined, the specific amino acid-base interactions comprising their structural interfaces are largely unknown. This lack of resolution hampers attempts to leverage these data in order to predict specificities for uncharacterized TFs or TFs mutated in disease. Here we introduce recognition code learning via automated mapping of protein-DNA structural interfaces (rCLAMPS), a probabilistic approach that uses DNA-binding specificities for TFs from the same structural family to simultaneously infer both which nucleotide positions are contacted by particular amino acids within the TF as well as a recognition code that relates each base-contacting amino acid to nucleotide preferences at the DNA positions it contacts. We apply rCLAMPS to homeodomains, the second largest family of TFs in metazoans and show that it learns a highly effective recognition code that can predict de novo DNA-binding specificities for TFs. Furthermore, we show that the inferred amino acid-nucleotide contacts reveal whether and how nucleotide preferences at individual binding site positions are altered by mutations within TFs. Our approach is an important step toward automatically uncovering the determinants of protein-DNA specificity from large compendia of DNA-binding specificities and inferring the altered functionalities of TFs mutated in disease.
Collapse
Affiliation(s)
- Joshua L Wetzel
- Department of Computer Science and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - Kaiqian Zhang
- Department of Computer Science and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| | - Mona Singh
- Department of Computer Science and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA
| |
Collapse
|
10
|
Lebeau B, Zhao K, Jangal M, Zhao T, Guerra M, Greenwood CMT, Witcher M. Single base-pair resolution analysis of DNA binding motif with MoMotif reveals an oncogenic function of CTCF zinc-finger 1 mutation. Nucleic Acids Res 2022; 50:8441-8458. [PMID: 35947648 PMCID: PMC9410893 DOI: 10.1093/nar/gkac658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Accepted: 07/21/2022] [Indexed: 12/24/2022] Open
Abstract
Defining the impact of missense mutations on the recognition of DNA motifs is highly dependent on bioinformatic tools that define DNA binding elements. However, classical motif analysis tools remain limited in their capacity to identify subtle changes in complex binding motifs between distinct conditions. To overcome this limitation, we developed a new tool, MoMotif, that facilitates a sensitive identification, at the single base-pair resolution, of complex, or subtle, alterations to core binding motifs, discerned from ChIP-seq data. We employed MoMotif to define the previously uncharacterized recognition motif of CTCF zinc-finger 1 (ZF1), and to further define the impact of CTCF ZF1 mutation on its association with chromatin. Mutations of CTCF ZF1 are exclusive to breast cancer and are associated with metastasis and therapeutic resistance, but the underlying mechanisms are unclear. Using MoMotif, we identified an extension of the CTCF core binding motif, necessitating a functional ZF1 to bind appropriately. Using a combination of ChIP-Seq and RNA-Seq, we discover that the inability to bind this extended motif drives an altered transcriptional program associated with the oncogenic phenotypes observed clinically. Our study demonstrates that MoMotif is a powerful new tool for comparative ChIP-seq analysis and characterising DNA-protein contacts.
Collapse
Affiliation(s)
| | | | - Maika Jangal
- Lady Davis Institute, Jewish General Hospital, Montréal, Québec H3T 1E2, Canada
| | - Tiejun Zhao
- Lady Davis Institute, Jewish General Hospital, Montréal, Québec H3T 1E2, Canada
| | - Maria Guerra
- Lady Davis Institute, Jewish General Hospital, Montréal, Québec H3T 1E2, Canada
| | - Celia M T Greenwood
- Correspondence may also be addressed to Celia Greenwood. Tel: +1 514 340 8222 (Ext 28397);
| | - Michael Witcher
- To whom correspondence should be addressed. Tel: +1 514 340 8222 (Ext 23363);
| |
Collapse
|
11
|
Yang H, Kolben T, Kessler M, Meister S, Paul C, van Dorp J, Eren S, Kuhn C, Rahmeh M, Herbst C, Fink SG, Weimer G, Mahner S, Jeschke U, von Schönfeldt V. FAM111A Is a Novel Molecular Marker for Oocyte Aging. Biomedicines 2022; 10:257. [PMID: 35203468 PMCID: PMC8869572 DOI: 10.3390/biomedicines10020257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 01/12/2022] [Accepted: 01/17/2022] [Indexed: 11/17/2022] Open
Abstract
Aging is the main cause of decline in oocyte quality, which can further trigger the failure of assisted reproductive technology (ART). Exploring age-related genes in oocytes is an important way to investigate the molecular mechanisms involved in oocyte aging. To provide novel insight into this field, we performed a pooled analysis of publicly available datasets, using the overlapping results of two statistical methods on two Gene Expression Omnibus (GEO) datasets. The methods utilized in the current study mainly include Spearman rank correlation, the Wilcoxon signed-rank test, t-tests, Venn diagrams, Gene Ontology (GO), Protein-Protein Interaction (PPI), Gene Set Enrichment Analysis (GSEA), Gene Set Variation Analysis (GSVA), and receiver operating characteristic (ROC) curve analysis. We identified hundreds of age-related genes across different gene expression datasets of in vitro maturation-metaphase II (IVM-MII) oocytes. Age-related genes in IVM-MII oocytes were involved in the biological processes of cellular metabolism, DNA replication, and histone modifications. Among these age-related genes, FAM111A expression presented a robust correlation with age, seen in the results of different statistical methods and different datasets. FAM111A is associated with the processes of chromosome segregation and cell cycle regulation. Thus, this enzyme is potentially an interesting novel marker for the aging of oocytes, and warrants further mechanistic study.
Collapse
Affiliation(s)
- Huixia Yang
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Thomas Kolben
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Mirjana Kessler
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Sarah Meister
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Corinna Paul
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Julia van Dorp
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Sibel Eren
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Christina Kuhn
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
- Department of Obstetrics and Gynecology, University Hospital Augsburg, 86156 Augsburg, Germany
| | - Martina Rahmeh
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Cornelia Herbst
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Sabine Gabriele Fink
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Gabriele Weimer
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Sven Mahner
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| | - Udo Jeschke
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
- Department of Obstetrics and Gynecology, University Hospital Augsburg, 86156 Augsburg, Germany
| | - Viktoria von Schönfeldt
- Department of Obstetrics and Gynecology, University Hospital, Ludwig-Maximilians-University, 81377 Munich, Germany; (H.Y.); (T.K.); (M.K.); (S.M.); (C.P.); (J.v.D.); (S.E.); (C.K.); (M.R.); (C.H.); (S.G.F.); (G.W.); (S.M.); (V.v.S.)
| |
Collapse
|
12
|
Dasgupta I, Flotte TR, Keeler AM. CRISPR/Cas-Dependent and Nuclease-Free In Vivo Therapeutic Gene Editing. Hum Gene Ther 2021; 32:275-293. [PMID: 33750221 PMCID: PMC7987363 DOI: 10.1089/hum.2021.013] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 02/27/2021] [Indexed: 12/19/2022] Open
Abstract
Precise gene manipulation by gene editing approaches facilitates the potential to cure several debilitating genetic disorders. Gene modification stimulated by engineered nucleases induces a double-stranded break (DSB) in the target genomic locus, thereby activating DNA repair mechanisms. DSBs triggered by nucleases are repaired either by the nonhomologous end-joining or the homology-directed repair pathway, enabling efficient gene editing. While there are several ongoing ex vivo genome editing clinical trials, current research underscores the therapeutic potential of CRISPR/Cas-based (clustered regularly interspaced short palindrome repeats-associated Cas nuclease) in vivo gene editing. In this review, we provide an overview of the CRISPR/Cas-mediated in vivo genome therapy applications and explore their prospective clinical translatability to treat human monogenic disorders. In addition, we discuss the various challenges associated with in vivo genome editing technologies and strategies used to circumvent them. Despite the robust and precise nuclease-mediated gene editing, a promoterless, nuclease-independent gene targeting strategy has been utilized to evade the drawbacks of the nuclease-dependent system, such as off-target effects, immunogenicity, and cytotoxicity. Thus, the rapidly evolving paradigm of gene editing technologies will continue to foster the progress of gene therapy applications.
Collapse
Affiliation(s)
- Ishani Dasgupta
- Department of Pediatrics, Horae Gene Therapy Center, University of Massachusetts, Worcester, Massachusetts, USA
| | - Terence R. Flotte
- Department of Pediatrics, Horae Gene Therapy Center, University of Massachusetts, Worcester, Massachusetts, USA
| | - Allison M. Keeler
- Department of Pediatrics, Horae Gene Therapy Center, University of Massachusetts, Worcester, Massachusetts, USA
| |
Collapse
|
13
|
Meseguer A, Årman F, Fornes O, Molina-Fernández R, Bonet J, Fernandez-Fuentes N, Oliva B. On the prediction of DNA-binding preferences of C2H2-ZF domains using structural models: application on human CTCF. NAR Genom Bioinform 2021; 2:lqaa046. [PMID: 33575598 PMCID: PMC7671317 DOI: 10.1093/nargab/lqaa046] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2020] [Revised: 05/07/2020] [Accepted: 06/10/2020] [Indexed: 12/25/2022] Open
Abstract
Cis2-His2 zinc finger (C2H2-ZF) proteins are the largest family of transcription factors in human and higher metazoans. To date, the DNA-binding preferences of many members of this family remain unknown. We have developed a computational method to predict their DNA-binding preferences. We have computed theoretical position weight matrices (PWMs) of proteins composed by C2H2-ZF domains, with the only requirement of an input structure. We have predicted more than two-third of a single zinc-finger domain binding site for about 70% variants of Zif268, a classical member of this family. We have successfully matched between 60 and 90% of the binding-site motif of examples of proteins composed by three C2H2-ZF domains in JASPAR, a standard database of PWMs. The tests are used as a proof of the capacity to scan a DNA fragment and find the potential binding sites of transcription-factors formed by C2H2-ZF domains. As an example, we have tested the approach to predict the DNA-binding preferences of the human chromatin binding factor CTCF. We offer a server to model the structure of a zinc-finger protein and predict its PWM.
Collapse
Affiliation(s)
- Alberto Meseguer
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Experimental and Health Science, University Pompeu Fabra, Barcelona, Catalonia 08005, Spain
| | - Filip Årman
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Experimental and Health Science, University Pompeu Fabra, Barcelona, Catalonia 08005, Spain
| | - Oriol Fornes
- Centre for Molecular Medicine and Therapeutics, BC Children's Hospital Research Institute, Department of Medical Genetics, University of British Columbia, Vancouver, BC V5Z 4H4, Canada
| | - Ruben Molina-Fernández
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Experimental and Health Science, University Pompeu Fabra, Barcelona, Catalonia 08005, Spain
| | - Jaume Bonet
- Laboratory of Protein Design & Immunoengineering, School of Engineering, Ecole Polytechnique Federale de Lausanne, Lausanne 1015, Vaud, Switzerland
| | - Narcis Fernandez-Fuentes
- Department of Biosciences, U Science Tech, Universitat de Vic-Universitat Central de Catalunya, Vic, Catalonia 08500, Spain
| | - Baldo Oliva
- Structural Bioinformatics Lab (GRIB-IMIM), Department of Experimental and Health Science, University Pompeu Fabra, Barcelona, Catalonia 08005, Spain
| |
Collapse
|
14
|
Goldberg GW, Spencer JM, Giganti DO, Camellato BR, Agmon N, Ichikawa DM, Boeke JD, Noyes MB. Engineered dual selection for directed evolution of SpCas9 PAM specificity. Nat Commun 2021; 12:349. [PMID: 33441553 PMCID: PMC7807044 DOI: 10.1038/s41467-020-20650-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Accepted: 11/18/2020] [Indexed: 12/26/2022] Open
Abstract
The widely used Streptococcus pyogenes Cas9 (SpCas9) nuclease derives its DNA targeting specificity from protein-DNA contacts with protospacer adjacent motif (PAM) sequences, in addition to base-pairing interactions between its guide RNA and target DNA. Previous reports have established that the PAM specificity of SpCas9 can be altered via positive selection procedures for directed evolution or other protein engineering strategies. Here we exploit in vivo directed evolution systems that incorporate simultaneous positive and negative selection to evolve SpCas9 variants with commensurate or improved activity on NAG PAMs relative to wild type and reduced activity on NGG PAMs, particularly YGG PAMs. We also show that the PAM preferences of available evolutionary intermediates effectively determine whether similar counterselection PAMs elicit different selection stringencies, and demonstrate that negative selection can be specifically increased in a yeast selection system through the fusion of compensatory zinc fingers to SpCas9.
Collapse
Affiliation(s)
- Gregory W Goldberg
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA.
| | - Jeffrey M Spencer
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA
| | - David O Giganti
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA
| | - Brendan R Camellato
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA
| | - Neta Agmon
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA
- Neochromosome, Inc., Alexandria Center for Life Science, New York, NY, 10016, USA
| | - David M Ichikawa
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA
| | - Jef D Boeke
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA
- Department of Biomedical Engineering, NYU Tandon School of Engineering, Brooklyn, NY, 11201, USA
| | - Marcus B Noyes
- Institute for Systems Genetics, NYU Langone Health, New York, NY, 10016, USA.
| |
Collapse
|
15
|
Mueller AL, Corbi-Verge C, Giganti DO, Ichikawa DM, Spencer JM, MacRae M, Garton M, Kim PM, Noyes MB. The geometric influence on the Cys2His2 zinc finger domain and functional plasticity. Nucleic Acids Res 2020; 48:6382-6402. [PMID: 32383734 PMCID: PMC7293014 DOI: 10.1093/nar/gkaa291] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 04/07/2020] [Accepted: 04/20/2020] [Indexed: 11/25/2022] Open
Abstract
The Cys2His2 zinc finger is the most common DNA-binding domain expanding in metazoans since the fungi human split. A proposed catalyst for this expansion is an arms race to silence transposable elements yet it remains poorly understood how this domain is able to evolve the required specificities. Likewise, models of its DNA binding specificity remain error prone due to a lack of understanding of how adjacent fingers influence each other's binding specificity. Here, we use a synthetic approach to exhaustively investigate binding geometry, one of the dominant influences on adjacent finger function. By screening over 28 billion protein–DNA interactions in various geometric contexts we find the plasticity of the most common natural geometry enables more functional amino acid combinations across all targets. Further, residues that define this geometry are enriched in genomes where zinc fingers are prevalent and specificity transitions would be limited in alternative geometries. Finally, these results demonstrate an exhaustive synthetic screen can produce an accurate model of domain function while providing mechanistic insight that may have assisted in the domains expansion.
Collapse
Affiliation(s)
- April L Mueller
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| | - Carles Corbi-Verge
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - David O Giganti
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| | - David M Ichikawa
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| | - Jeffrey M Spencer
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| | - Mark MacRae
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| | - Michael Garton
- Institute of Biomaterials and Biomedical Engineering, University of Toronto, Toronto, Ontario M5S 3G9, Canada
| | - Philip M Kim
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S3E1, Canada.,Department of Computer Science, University of Toronto, Toronto, Ontario M5S3E1, Canada
| | - Marcus B Noyes
- Institute for Systems Genetics, NYU Langone Health, New York, NY 10016, USA.,Department of Biochemistry and Molecular Pharmacology, NYU Langone Health, New York, NY 10016, USA
| |
Collapse
|
16
|
Ichikawa DM, Corbi-Verge C, Shen MJ, Snider J, Wong V, Stagljar I, Kim PM, Noyes MB. A Multireporter Bacterial 2-Hybrid Assay for the High-Throughput and Dynamic Assay of PDZ Domain-Peptide Interactions. ACS Synth Biol 2019; 8:918-928. [PMID: 30969105 DOI: 10.1021/acssynbio.8b00499] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The accurate determination of protein-protein interactions has been an important focus of molecular biology toward which much progress has been made due to the continuous development of existing and new technologies. However, current methods can have limitations, including scale and restriction to high affinity interactions, limiting our understanding of a large subset of these interactions. Here, we describe a modified bacterial-hybrid assay that employs combined selectable and scalable reporters that enable the sensitive screening of large peptide libraries followed by the sorting of positive interactions by the level of reporter output. We have applied this tool to characterize a set of human and E. coli PDZ domains. Our results are consistent with prior characterization of these proteins, and the improved sensitivity increases our ability to predict known and novel in vivo binding partners. This approach allows for the recovery of a wide range of affinities with a high throughput method that does not sacrifice the scale of the screen.
Collapse
Affiliation(s)
- David M. Ichikawa
- Department of Biochemistry Molecular Pharmacology and Institute for Systems Genetics, NYU Langone Health, New York, New York 10016, United States
| | - Carles Corbi-Verge
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Michael J. Shen
- Department of Biochemistry Molecular Pharmacology and Institute for Systems Genetics, NYU Langone Health, New York, New York 10016, United States
| | - Jamie Snider
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Victoria Wong
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Igor Stagljar
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Philip M. Kim
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 3E1, Canada
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 3E1, Canada
| | - Marcus B. Noyes
- Department of Biochemistry Molecular Pharmacology and Institute for Systems Genetics, NYU Langone Health, New York, New York 10016, United States
| |
Collapse
|
17
|
Higgins SA, Savage DF. Protein Science by DNA Sequencing: How Advances in Molecular Biology Are Accelerating Biochemistry. Biochemistry 2017; 57:38-46. [DOI: 10.1021/acs.biochem.7b00886] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Sean A. Higgins
- Department
of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States
| | - David F. Savage
- Department
of Molecular and Cell Biology, University of California, Berkeley, Berkeley, California 94720, United States
- Department
of Chemistry, University of California, Berkeley, Berkeley, California 94720, United States
| |
Collapse
|
18
|
The Development and Use of Zinc-Finger Nucleases. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016. [DOI: 10.1007/978-1-4939-3509-3_2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
19
|
Blackburn MC, Petrova E, Correia BE, Maerkl SJ. Integrating gene synthesis and microfluidic protein analysis for rapid protein engineering. Nucleic Acids Res 2015; 44:e68. [PMID: 26704969 PMCID: PMC4838357 DOI: 10.1093/nar/gkv1497] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Accepted: 12/09/2015] [Indexed: 11/15/2022] Open
Abstract
The capability to rapidly design proteins with novel functions will have a significant impact on medicine, biotechnology and synthetic biology. Synthetic genes are becoming a commodity, but integrated approaches have yet to be developed that take full advantage of gene synthesis. We developed a solid-phase gene synthesis method based on asymmetric primer extension (APE) and coupled this process directly to high-throughput, on-chip protein expression, purification and characterization (via mechanically induced trapping of molecular interactions, MITOMI). By completely circumventing molecular cloning and cell-based steps, APE-MITOMI reduces the time between protein design and quantitative characterization to 3–4 days. With APE-MITOMI we synthesized and characterized over 400 zinc-finger (ZF) transcription factors (TF), showing that although ZF TFs can be readily engineered to recognize a particular DNA sequence, engineering the precise binding energy landscape remains challenging. We also found that it is possible to engineer ZF–DNA affinity precisely and independently of sequence specificity and that in silico modeling can explain some of the observed affinity differences. APE-MITOMI is a generic approach that should facilitate fundamental studies in protein biophysics, and protein design/engineering.
Collapse
Affiliation(s)
- Matthew C Blackburn
- Institute of Bioengineering, School of Engineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Ekaterina Petrova
- Institute of Bioengineering, School of Engineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Bruno E Correia
- Institute of Bioengineering, School of Engineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Sebastian J Maerkl
- Institute of Bioengineering, School of Engineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| |
Collapse
|
20
|
Abstract
The ability to manipulate the genome with precise spatial and nucleotide resolution (genome editing) has been a powerful research tool. In the past decade, the tools and expertise for using genome editing in human somatic cells and pluripotent cells have increased to such an extent that the approach is now being developed widely as a strategy to treat human disease. The fundamental process depends on creating a site-specific DNA double-strand break (DSB) in the genome and then allowing the cell's endogenous DSB repair machinery to fix the break such that precise nucleotide changes are made to the DNA sequence. With the development and discovery of several different nuclease platforms and increasing knowledge of the parameters affecting different genome editing outcomes, genome editing frequencies now reach therapeutic relevance for a wide variety of diseases. Moreover, there is a series of complementary approaches to assessing the safety and toxicity of any genome editing process, irrespective of the underlying nuclease used. Finally, the development of genome editing has raised the issue of whether it should be used to engineer the human germline. Although such an approach could clearly prevent the birth of people with devastating and destructive genetic diseases, questions remain about whether human society is morally responsible enough to use this tool.
Collapse
Affiliation(s)
- Matthew Porteus
- Department of Pediatrics, Division of Stem Cell Transplantation and Regenerative Medicine, Stanford University, Stanford, California 94305;
| |
Collapse
|
21
|
Nadimpalli S, Persikov AV, Singh M. Pervasive variation of transcription factor orthologs contributes to regulatory network evolution. PLoS Genet 2015; 11:e1005011. [PMID: 25748510 PMCID: PMC4351887 DOI: 10.1371/journal.pgen.1005011] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2014] [Accepted: 01/18/2015] [Indexed: 01/17/2023] Open
Abstract
Differences in transcriptional regulatory networks underlie much of the phenotypic variation observed across organisms. Changes to cis-regulatory elements are widely believed to be the predominant means by which regulatory networks evolve, yet examples of regulatory network divergence due to transcription factor (TF) variation have also been observed. To systematically ascertain the extent to which TFs contribute to regulatory divergence, we analyzed the evolution of the largest class of metazoan TFs, Cys2-His2 zinc finger (C2H2-ZF) TFs, across 12 Drosophila species spanning ~45 million years of evolution. Remarkably, we uncovered that a significant fraction of all C2H2-ZF 1-to-1 orthologs in flies exhibit variations that can affect their DNA-binding specificities. In addition to loss and recruitment of C2H2-ZF domains, we found diverging DNA-contacting residues in ~44% of domains shared between D. melanogaster and the other fly species. These diverging DNA-contacting residues, found in ~70% of the D. melanogaster C2H2-ZF genes in our analysis and corresponding to ~26% of all annotated D. melanogaster TFs, show evidence of functional constraint: they tend to be conserved across phylogenetic clades and evolve slower than other diverging residues. These same variations were rarely found as polymorphisms within a population of D. melanogaster flies, indicating their rapid fixation. The predicted specificities of these dynamic domains gradually change across phylogenetic distances, suggesting stepwise evolutionary trajectories for TF divergence. Further, whereas proteins with conserved C2H2-ZF domains are enriched in developmental functions, those with varying domains exhibit no functional enrichments. Our work suggests that a subset of highly dynamic and largely unstudied TFs are a likely source of regulatory variation in Drosophila and other metazoans.
Collapse
Affiliation(s)
- Shilpa Nadimpalli
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Anton V. Persikov
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Mona Singh
- Department of Computer Science, Princeton University, Princeton, New Jersey, United States of America
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
22
|
Persikov AV, Wetzel JL, Rowland EF, Oakes BL, Xu DJ, Singh M, Noyes MB. A systematic survey of the Cys2His2 zinc finger DNA-binding landscape. Nucleic Acids Res 2015; 43:1965-84. [PMID: 25593323 PMCID: PMC4330361 DOI: 10.1093/nar/gku1395] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Cys2His2 zinc fingers (C2H2-ZFs) comprise the largest class of metazoan DNA-binding domains. Despite this domain's well-defined DNA-recognition interface, and its successful use in the design of chimeric proteins capable of targeting genomic regions of interest, much remains unknown about its DNA-binding landscape. To help bridge this gap in fundamental knowledge and to provide a resource for design-oriented applications, we screened large synthetic protein libraries to select binding C2H2-ZF domains for each possible three base pair target. The resulting data consist of >160 000 unique domain-DNA interactions and comprise the most comprehensive investigation of C2H2-ZF DNA-binding interactions to date. An integrated analysis of these independent screens yielded DNA-binding profiles for tens of thousands of domains and led to the successful design and prediction of C2H2-ZF DNA-binding specificities. Computational analyses uncovered important aspects of C2H2-ZF domain-DNA interactions, including the roles of within-finger context and domain position on base recognition. We observed the existence of numerous distinct binding strategies for each possible three base pair target and an apparent balance between affinity and specificity of binding. In sum, our comprehensive data help elucidate the complex binding landscape of C2H2-ZF domains and provide a foundation for efforts to determine, predict and engineer their DNA-binding specificities.
Collapse
Affiliation(s)
- Anton V Persikov
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Joshua L Wetzel
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
| | - Elizabeth F Rowland
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Benjamin L Oakes
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Denise J Xu
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Mona Singh
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA Department of Computer Science, Princeton University, Princeton, NJ 08544, USA
| | - Marcus B Noyes
- The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| |
Collapse
|
23
|
Abstract
Understanding how sequence-specific protein-DNA interactions direct cellular function is of great interest to the research community. High-throughput methods have been developed to determine DNA-binding specificities; one such technique, the bacterial one-hybrid (B1H) system, confers advantages including ease of use, sensitivity and throughput. In this review, we describe the evolution of the B1H system as a tool capable of screening large DNA libraries to investigate protein-DNA interactions of interest. We discuss how DNA-binding specificities produced by the B1H system have been used to predict regulatory targets. Additionally, we examine how this approach has been applied to characterize two common DNA-binding domain families-homeodomains and Cys2His2 zinc fingers-both in organism-wide studies and with synthetic approaches. In the case of the former, the B1H system has produced large catalogs of protein specificity and nuanced information about previously recovered DNA targets, thereby improving our understanding of these proteins' functions in vivo and increasing our capacity to predict similar interactions in other species. In the latter, synthetic screens of the same DNA-binding domains have further refined our models of specificity, through analyzing comprehensive libraries to uncover all proteins able to bind a complete set of targets, and, for instance, exploring how context-in the form of domain position within the parent protein-may affect specificity. Finally, we recognize the limitations of the B1H system and discuss its potential for use in the production of designer proteins and in studies of protein-protein interactions.
Collapse
|
24
|
Controlling gene networks and cell fate with precision-targeted DNA-binding proteins and small-molecule-based genome readers. Biochem J 2014; 462:397-413. [PMID: 25145439 DOI: 10.1042/bj20140400] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Transcription factors control the fate of a cell by regulating the expression of genes and regulatory networks. Recent successes in inducing pluripotency in terminally differentiated cells as well as directing differentiation with natural transcription factors has lent credence to the efforts that aim to direct cell fate with rationally designed transcription factors. Because DNA-binding factors are modular in design, they can be engineered to target specific genomic sequences and perform pre-programmed regulatory functions upon binding. Such precision-tailored factors can serve as molecular tools to reprogramme or differentiate cells in a targeted manner. Using different types of engineered DNA binders, both regulatory transcriptional controls of gene networks, as well as permanent alteration of genomic content, can be implemented to study cell fate decisions. In the present review, we describe the current state of the art in artificial transcription factor design and the exciting prospect of employing artificial DNA-binding factors to manipulate the transcriptional networks as well as epigenetic landscapes that govern cell fate.
Collapse
|
25
|
Siggers T, Reddy J, Barron B, Bulyk ML. Diversification of transcription factor paralogs via noncanonical modularity in C2H2 zinc finger DNA binding. Mol Cell 2014; 55:640-8. [PMID: 25042805 DOI: 10.1016/j.molcel.2014.06.019] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2014] [Revised: 05/27/2014] [Accepted: 06/09/2014] [Indexed: 12/25/2022]
Abstract
A major challenge in obtaining a full molecular description of evolutionary adaptation is to characterize how transcription factor (TF) DNA-binding specificity can change. To identify mechanisms of TF diversification, we performed detailed comparisons of yeast C2H2 ZF proteins with identical canonical recognition residues that are expected to bind the same DNA sequences. Unexpectedly, we found that ZF proteins can adapt to recognize new binding sites in a modular fashion whereby binding to common core sites remains unaffected. We identified two distinct mechanisms, conserved across multiple Ascomycota species, by which this molecular adaptation occurred. Our results suggest a route for TF evolution that alleviates negative pleiotropic effects by modularly gaining new binding sites. These findings expand our current understanding of ZF DNA binding and provide evidence for paralogous ZFs utilizing alternate modes of DNA binding to recognize unique sets of noncanonical binding sites.
Collapse
Affiliation(s)
- Trevor Siggers
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Department of Biology, Boston University, Boston, MA 02215, USA.
| | - Jessica Reddy
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Brian Barron
- Department of Biology, Boston University, Boston, MA 02215, USA
| | - Martha L Bulyk
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA; Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
26
|
Kabadi AM, Gersbach CA. Engineering synthetic TALE and CRISPR/Cas9 transcription factors for regulating gene expression. Methods 2014; 69:188-97. [PMID: 25010559 DOI: 10.1016/j.ymeth.2014.06.014] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2014] [Revised: 06/26/2014] [Accepted: 06/27/2014] [Indexed: 12/22/2022] Open
Abstract
Engineered DNA-binding proteins that can be targeted to specific sites in the genome to manipulate gene expression have enabled many advances in biomedical research. This includes generating tools to study fundamental aspects of gene regulation and the development of a new class of gene therapies that alter the expression of endogenous genes. Designed transcription factors have entered clinical trials for the treatment of human diseases and others are in preclinical development. High-throughput and user-friendly platforms for designing synthetic DNA-binding proteins present innovative methods for deciphering cell biology and designing custom synthetic gene circuits. We review two platforms for designing synthetic transcription factors for manipulating gene expression: Transcription activator-like effectors (TALEs) and the RNA-guided clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 system. We present an overview of each technology and a guide for designing and assembling custom TALE- and CRISPR/Cas9-based transcription factors. We also discuss characteristics of each platform that are best suited for different applications.
Collapse
Affiliation(s)
- Ami M Kabadi
- Department of Biomedical Engineering, Duke University, Durham, NC 27708, United States
| | - Charles A Gersbach
- Department of Biomedical Engineering, Duke University, Durham, NC 27708, United States; Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708, United States; Department of Orthopaedic Surgery, Duke University Medical Center, Durham, NC 27710, United States.
| |
Collapse
|
27
|
The functional significance of common polymorphisms in zinc finger transcription factors. G3-GENES GENOMES GENETICS 2014; 4:1647-55. [PMID: 24970883 PMCID: PMC4169156 DOI: 10.1534/g3.114.012195] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Variants that alter the DNA-binding specificity of transcription factors could affect the specificity for and expression of potentially many target genes, as has been observed in several tumor-derived mutations. Here we examined if such trans expression quantitative trait loci (trans-eQTLs) could similarly result from common genetic variants. We chose to focus on the Cys2-His2 class of zinc finger transcription factors because they are the most abundant superfamily of transcription factors in human and have well-characterized DNA binding interactions. We identified 430 SNPs that cause missense substitutions in the DNA-contacting residues. Fewer common missense SNPs were found at DNA-contacting residues compared with non-DNA-contacting residues (P = 0.00006), consistent with possible functional selection against SNPs at DNA-contacting positions. Functional predictions based on zinc finger transcription factor (ZNF) DNA binding preferences also suggested that many common substitutions could potentially alter binding specificity. However, Hardy-Weinberg Equilibrium analysis and examination of seven orthologs within the primate lineage failed to find evidence of trans-eQTLs associated with the DNA-contacting positions or evidence of a different selection pressure on a contemporary and evolutionary timescales. The overall conclusion was that common SNPs that alter the DNA-contacting residues of these factors are unlikely to produce strong trans-eQTLs, consistent with the observations by others that trans-eQTLs in humans tend to be few and weak. Some rare SNPs might alter specificity and remained rare due to purifying selection. The study also underscores the need for large-scale eQTLs mapping efforts that might provide experimental evidence for SNPs that alter the choice of transcription factor binding sites.
Collapse
|
28
|
Gupta A, Christensen RG, Bell HA, Goodwin M, Patel RY, Pandey M, Enuameh MS, Rayla AL, Zhu C, Thibodeau-Beganny S, Brodsky MH, Joung JK, Wolfe SA, Stormo GD. An improved predictive recognition model for Cys(2)-His(2) zinc finger proteins. Nucleic Acids Res 2014; 42:4800-12. [PMID: 24523353 PMCID: PMC4005693 DOI: 10.1093/nar/gku132] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2013] [Revised: 01/21/2014] [Accepted: 01/22/2014] [Indexed: 11/17/2022] Open
Abstract
Cys(2)-His(2) zinc finger proteins (ZFPs) are the largest family of transcription factors in higher metazoans. They also represent the most diverse family with regards to the composition of their recognition sequences. Although there are a number of ZFPs with characterized DNA-binding preferences, the specificity of the vast majority of ZFPs is unknown and cannot be directly inferred by homology due to the diversity of recognition residues present within individual fingers. Given the large number of unique zinc fingers and assemblies present across eukaryotes, a comprehensive predictive recognition model that could accurately estimate the DNA-binding specificity of any ZFP based on its amino acid sequence would have great utility. Toward this goal, we have used the DNA-binding specificities of 678 two-finger modules from both natural and artificial sources to construct a random forest-based predictive model for ZFP recognition. We find that our recognition model outperforms previously described determinant-based recognition models for ZFPs, and can successfully estimate the specificity of naturally occurring ZFPs with previously defined specificities.
Collapse
Affiliation(s)
- Ankit Gupta
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Ryan G. Christensen
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Heather A. Bell
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Mathew Goodwin
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Ronak Y. Patel
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Manishi Pandey
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Metewo Selase Enuameh
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Amy L. Rayla
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Cong Zhu
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Stacey Thibodeau-Beganny
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Michael H. Brodsky
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - J. Keith Joung
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Scot A. Wolfe
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| | - Gary D. Stormo
- Program in Gene Function and Expression, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, MA 01605, USA, Department of Genetics, Washington University School of Medicine, St Louis, MO 63108, USA, Department of Biochemistry and Biology and Biotechnology, Worcester Polytechnic Institute, Worcester, MA 01609, USA, Molecular Pathology Unit, Center for Computational and Integrative Biology, and Center for Cancer Research, Massachusetts General Hospital, Charlestown, MA 02129, USA, Department of Molecular Medicine, University of Massachusetts Medical School, Worcester, MA 01605, USA and Department of Pathology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
29
|
Abstract
CRISPR/Cas systems act to protect the cell from invading nucleic acids in many bacteria and archaea. The bacterial immune protein Cas9 is a component of one of these CRISPR/Cas systems and has recently been adapted as a tool for genome editing. Cas9 is easily targeted to bind and cleave a DNA sequence via a complementary RNA; this straightforward programmability has gained Cas9 rapid acceptance in the field of genetic engineering. While this technology has developed quickly, a number of challenges regarding Cas9 specificity, efficiency, fusion protein function, and spatiotemporal control within the cell remain. In this work, we develop a platform for constructing novel proteins to address these open questions. We demonstrate methods to either screen or select active Cas9 mutants and use the screening technique to isolate functional Cas9 variants with a heterologous PDZ domain inserted within the protein. As a proof of concept, these methods lay the groundwork for the future construction of diverse Cas9 proteins. Straightforward and accessible techniques for genetic editing are helping to elucidate biology in new and exciting ways; a platform to engineer new functionalities into Cas9 will help forge the next generation of genome-modifying tools.
Collapse
Affiliation(s)
- Benjamin L Oakes
- Department of Molecular & Cell Biology, University of California, Berkeley, California, USA
| | - Dana C Nadler
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California, USA
| | - David F Savage
- Department of Molecular & Cell Biology, University of California, Berkeley, California, USA; Department of Chemistry, University of California, Berkeley, California, USA; Energy Biosciences Institute, University of California, Berkeley, California, USA.
| |
Collapse
|