1
|
Purtov YA, Ozoline ON. Neuromodulators as Interdomain Signaling Molecules Capable of Occupying Effector Binding Sites in Bacterial Transcription Factors. Int J Mol Sci 2023; 24:15863. [PMID: 37958845 PMCID: PMC10647483 DOI: 10.3390/ijms242115863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 10/29/2023] [Accepted: 10/30/2023] [Indexed: 11/15/2023] Open
Abstract
Hormones and neurotransmitters are important components of inter-kingdom signaling systems that ensure the coexistence of eukaryotes with their microbial community. Their ability to affect bacterial physiology, metabolism, and gene expression was evidenced by various experimental approaches, but direct penetration into bacteria has only recently been reported. This opened the possibility of considering neuromodulators as potential effectors of bacterial ligand-dependent regulatory proteins. Here, we assessed the validity of this assumption for the neurotransmitters epinephrine, dopamine, and norepinephrine and two hormones (melatonin and serotonin). Using flexible molecular docking for transcription factors with ligand-dependent activity, we assessed the ability of neuromodulators to occupy their effector binding sites. For many transcription factors, including the global regulator of carbohydrate metabolism, CRP, and the key regulator of lactose assimilation, LacI, this ability was predicted based on the analysis of several 3D models. By occupying the ligand binding site, neuromodulators can sterically hinder the interaction of the target proteins with the natural effectors or even replace them. The data obtained suggest that the direct modulation of the activity of at least some bacterial transcriptional factors by neuromodulators is possible. Therefore, the natural hormonal background may be a factor that preadapts bacteria to the habitat through direct perception of host signaling molecules.
Collapse
Affiliation(s)
- Yuri A. Purtov
- Department of Functional Genomics of Prokaryotes, Institute of Cell Biophysics of the Russian Academy of Sciences, Federal Research Center Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Pushchino 142290, Russia
| | - Olga N. Ozoline
- Department of Functional Genomics of Prokaryotes, Institute of Cell Biophysics of the Russian Academy of Sciences, Federal Research Center Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, Pushchino 142290, Russia
| |
Collapse
|
2
|
Fu Y, Yeom SJ, Kwon KK, Hwang J, Kim H, Woo EJ, Lee DH, Lee SG. Structural and functional analyses of the cellulase transcription regulator CelR. FEBS Lett 2018; 592:2776-2785. [PMID: 30062758 DOI: 10.1002/1873-3468.13206] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2018] [Revised: 07/05/2018] [Accepted: 07/19/2018] [Indexed: 11/10/2022]
Abstract
CelR is a transcriptional regulator that controls the expression of cellulases catalyzing cellulose hydrolysis. However, the structural mechanism of its regulation has remained unclear. Here, we report the first structure of CelR, in this case with cellobiose bound. CelR consists of a DNA-binding domain (DBD) and a regulatory domain (RD), and homodimerizes with each monomer bound to cellobiose. A hinge region (HR) in CelR connects the DBD with the RD, and Leu59 in the HR acts as a 'leucine lever' that transduces a transcriptional activation signal. Furthermore, an α4 helix mediates the ligand-binding signal for transcriptional activation. Tyr84 and Gln301 can potentially alter the ligand specificity of CelR. This study provides a pivotal step toward understanding transcription of the cellulases.
Collapse
Affiliation(s)
- Yaoyao Fu
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,The Key Laboratory of Biotechnology for Medicinal Plant of Jiangsu Province, Jiangsu Normal University, China
| | - Soo-Jin Yeom
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea
| | - Kil Koang Kwon
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea
| | - Jungwon Hwang
- Infection and Immunity Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea
| | - Haseong Kim
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Biosystems and Bioengineering, KRIBB School of Biotechnology, University of Science and Technology (UST), Daejeon, Korea
| | - Eui-Jeon Woo
- Disease Target Structure Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Bio-Analytical Science, KRIBB School of Bioscience, University of Science and Technology (UST), Daejeon, Korea
| | - Dae-Hee Lee
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Biosystems and Bioengineering, KRIBB School of Biotechnology, University of Science and Technology (UST), Daejeon, Korea
| | - Seung-Goo Lee
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Biosystems and Bioengineering, KRIBB School of Biotechnology, University of Science and Technology (UST), Daejeon, Korea
| |
Collapse
|
3
|
Xu JS, Hewitt MN, Gulati JS, Cruz MA, Zhan H, Liu S, Matthews KS. Lactose repressor hinge domain independently binds DNA. Protein Sci 2018; 27:839-847. [PMID: 29318690 PMCID: PMC5866929 DOI: 10.1002/pro.3372] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Revised: 01/02/2018] [Accepted: 01/02/2018] [Indexed: 12/29/2022]
Abstract
The short 8-10 amino acid "hinge" sequence in lactose repressor (LacI), present in other LacI/GalR family members, links DNA and inducer-binding domains. Structural studies of full-length or truncated LacI-operator DNA complexes demonstrate insertion of the dimeric helical "hinge" structure at the center of the operator sequence. This association bends the DNA ∼40° and aligns flanking semi-symmetric DNA sites for optimal contact by the N-terminal helix-turn-helix (HtH) sequences within each dimer. In contrast, the hinge region remains unfolded when bound to nonspecific DNA sequences. To determine ability of the hinge helix alone to mediate DNA binding, we examined (i) binding of LacI variants with deletion of residues 1-50 to remove the HtH DNA binding domain or residues 1-58 to remove both HtH and hinge domains and (ii) binding of a synthetic peptide corresponding to the hinge sequence with a Val52Cys substitution that allows reversible dimer formation via a disulfide linkage. Binding affinity for DNA is orders of magnitude lower in the absence of the helix-turn-helix domain with its highly positive charge. LacI missing residues 1-50 binds to DNA with ∼4-fold greater affinity for operator than for nonspecific sequences with minimal impact of inducer presence; in contrast, LacI missing residues 1-58 exhibits no detectable affinity for DNA. In oxidized form, the dimeric hinge peptide alone binds to O1 and nonspecific DNA with similarly small difference in affinity; reduction to monomer diminished binding to both O1 and nonspecific targets. These results comport with recent reports regarding LacI hinge interaction with DNA sequences.
Collapse
Affiliation(s)
- Joseph S Xu
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Madeleine N Hewitt
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Jaskeerat S Gulati
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Matthew A Cruz
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Hongli Zhan
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Shirley Liu
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | | |
Collapse
|
4
|
Sousa FL, Parente DJ, Hessman JA, Chazelle A, Teichmann SA, Swint-Kruse L. Data on publications, structural analyses, and queries used to build and utilize the AlloRep database. Data Brief 2016; 8:948-57. [PMID: 27508249 PMCID: PMC4961497 DOI: 10.1016/j.dib.2016.07.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 06/22/2016] [Accepted: 07/04/2016] [Indexed: 01/08/2023] Open
Abstract
The AlloRep database (www.AlloRep.org) (Sousa et al., 2016) [1] compiles extensive sequence, mutagenesis, and structural information for the LacI/GalR family of transcription regulators. Sequence alignments are presented for >3000 proteins in 45 paralog subfamilies and as a subsampled alignment of the whole family. Phenotypic and biochemical data on almost 6000 mutants have been compiled from an exhaustive search of the literature; citations for these data are included herein. These data include information about oligomerization state, stability, DNA binding and allosteric regulation. Protein structural data for 65 proteins are presented as easily-accessible, residue-contact networks. Finally, this article includes example queries to enable the use of the AlloRep database. See the related article, “AlloRep: a repository of sequence, structural and mutagenesis data for the LacI/GalR transcription regulators” (Sousa et al., 2016) [1].
Collapse
Affiliation(s)
- Filipa L Sousa
- Institute of Molecular Evolution, Heinrich-Heine Universität Düsseldorf, Universitätstrasse 1, 40225 Düsseldorf, Germany
| | - Daniel J Parente
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Jacob A Hessman
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Allen Chazelle
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Sarah A Teichmann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK; Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Liskin Swint-Kruse
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| |
Collapse
|
5
|
Abstract
We review literature on the metabolism of ribo- and deoxyribonucleotides, nucleosides, and nucleobases in Escherichia coli and Salmonella,including biosynthesis, degradation, interconversion, and transport. Emphasis is placed on enzymology and regulation of the pathways, at both the level of gene expression and the control of enzyme activity. The paper begins with an overview of the reactions that form and break the N-glycosyl bond, which binds the nucleobase to the ribosyl moiety in nucleotides and nucleosides, and the enzymes involved in the interconversion of the different phosphorylated states of the nucleotides. Next, the de novo pathways for purine and pyrimidine nucleotide biosynthesis are discussed in detail.Finally, the conversion of nucleosides and nucleobases to nucleotides, i.e.,the salvage reactions, are described. The formation of deoxyribonucleotides is discussed, with emphasis on ribonucleotidereductase and pathways involved in fomation of dUMP. At the end, we discuss transport systems for nucleosides and nucleobases and also pathways for breakdown of the nucleobases.
Collapse
|
6
|
Racca JD, Chen YS, Maloy JD, Wickramasinghe N, Phillips NB, Weiss MA. Structure-function relationships in human testis-determining factor SRY: an aromatic buttress underlies the specific DNA-bending surface of a high mobility group (HMG) box. J Biol Chem 2014; 289:32410-29. [PMID: 25258310 DOI: 10.1074/jbc.m114.597526] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Human testis determination is initiated by SRY, a Y-encoded architectural transcription factor. Mutations in SRY cause 46 XY gonadal dysgenesis with female somatic phenotype (Swyer syndrome) and confer a high risk of malignancy (gonadoblastoma). Such mutations cluster in the SRY high mobility group (HMG) box, a conserved motif of specific DNA binding and bending. To explore structure-function relationships, we constructed all possible substitutions at a site of clinical mutation (W70L). Our studies thus focused on a core aromatic residue (position 15 of the consensus HMG box) that is invariant among SRY-related HMG box transcription factors (the SOX family) and conserved as aromatic (Phe or Tyr) among other sequence-specific boxes. In a yeast one-hybrid system sensitive to specific SRY-DNA binding, the variant domains exhibited reduced (Phe and Tyr) or absent activity (the remaining 17 substitutions). Representative nonpolar variants with partial or absent activity (Tyr, Phe, Leu, and Ala in order of decreasing side-chain volume) were chosen for study in vitro and in mammalian cell culture. The clinical mutation (Leu) was found to markedly impair multiple biochemical and cellular activities as respectively probed through the following: (i) in vitro assays of specific DNA binding and protein stability, and (ii) cell culture-based assays of proteosomal degradation, nuclear import, enhancer DNA occupancy, and SRY-dependent transcriptional activation. Surprisingly, however, DNA bending is robust to this or the related Ala substitution that profoundly impairs box stability. Together, our findings demonstrate that the folding, trafficking, and gene-regulatory function of SRY requires an invariant aromatic "buttress" beneath its specific DNA-bending surface.
Collapse
Affiliation(s)
- Joseph D Racca
- From the Department of Biochemistry, Case Western Reserve University, Cleveland, Ohio 44106
| | - Yen-Shan Chen
- From the Department of Biochemistry, Case Western Reserve University, Cleveland, Ohio 44106
| | - James D Maloy
- From the Department of Biochemistry, Case Western Reserve University, Cleveland, Ohio 44106
| | - Nalinda Wickramasinghe
- From the Department of Biochemistry, Case Western Reserve University, Cleveland, Ohio 44106
| | - Nelson B Phillips
- From the Department of Biochemistry, Case Western Reserve University, Cleveland, Ohio 44106
| | - Michael A Weiss
- From the Department of Biochemistry, Case Western Reserve University, Cleveland, Ohio 44106
| |
Collapse
|
7
|
Zhou C, Meysman P, Cule B, Laukens K, Goethals B. Discovery of Spatially Cohesive Itemsets in Three-Dimensional Protein Structures. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:814-825. [PMID: 26356855 DOI: 10.1109/tcbb.2014.2311795] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
In this paper we present a cohesive structural itemset miner aiming to discover interesting patterns in a set of data objects within a multidimensional spatial structure by combining the cohesion and the support of the pattern. We propose two ways to build the itemset miner, VertexOne and VertexAll, in an attempt to find a balance between accuracy and run-times. The experiments show that VertexOne performs better, and finds almost the same itemsets as VertexAll in a much shorter time. The usefulness of the method is demonstrated by applying it to find interesting patterns of amino acids in spatial proximity within a set of proteins based on their atomic coordinates in the protein molecular structure. Several patterns found by the cohesive structural itemset miner contain amino acids that frequently co-occur in the spatial structure, even if they are distant in the primary protein sequence and only brought together by protein folding. Further various indications were found that some of the discovered patterns seem to represent common underlying support structures within the proteins.
Collapse
|
8
|
Meinhardt S, Manley MW, Parente DJ, Swint-Kruse L. Rheostats and toggle switches for modulating protein function. PLoS One 2013; 8:e83502. [PMID: 24386217 PMCID: PMC3875437 DOI: 10.1371/journal.pone.0083502] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2013] [Accepted: 11/03/2013] [Indexed: 01/08/2023] Open
Abstract
The millions of protein sequences generated by genomics are expected to transform protein engineering and personalized medicine. To achieve these goals, tools for predicting outcomes of amino acid changes must be improved. Currently, advances are hampered by insufficient experimental data about nonconserved amino acid positions. Since the property “nonconserved” is identified using a sequence alignment, we designed experiments to recapitulate that context: Mutagenesis and functional characterization was carried out in 15 LacI/GalR homologs (rows) at 12 nonconserved positions (columns). Multiple substitutions were made at each position, to reveal how various amino acids of a nonconserved column were tolerated in each protein row. Results showed that amino acid preferences of nonconserved positions were highly context-dependent, had few correlations with physico-chemical similarities, and were not predictable from their occurrence in natural LacI/GalR sequences. Further, unlike the “toggle switch” behaviors of conserved positions, substitutions at nonconserved positions could be rank-ordered to show a “rheostatic”, progressive effect on function that spanned several orders of magnitude. Comparisons to various sequence analyses suggested that conserved and strongly co-evolving positions act as functional toggles, whereas other important, nonconserved positions serve as rheostats for modifying protein function. Both the presence of rheostat positions and the sequence analysis strategy appear to be generalizable to other protein families and should be considered when engineering protein modifications or predicting the impact of protein polymorphisms.
Collapse
Affiliation(s)
- Sarah Meinhardt
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Michael W. Manley
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Daniel J. Parente
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
- * E-mail:
| |
Collapse
|
9
|
Tungtur S, Parente DJ, Swint-Kruse L. Functionally important positions can comprise the majority of a protein's architecture. Proteins 2011; 79:1589-608. [PMID: 21374721 DOI: 10.1002/prot.22985] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Revised: 12/08/2010] [Accepted: 12/15/2010] [Indexed: 01/13/2023]
Abstract
Concomitant with the genomic era, many bioinformatics programs have been developed to identify functionally important positions from sequence alignments of protein families. To evaluate these analyses, many have used the LacI/GalR family and determined whether positions predicted to be "important" are validated by published experiments. However, we previously noted that predictions do not identify all of the experimentally important positions present in the linker regions of these homologs. In an attempt to reconcile these differences, we corrected and expanded the LacI/GalR sequence set commonly used in sequence/function analyses. Next, a variety of analyses were carried out (1) for the entire LacI/GalR sequence set and (2) for a subset of homologs with functionally-important "YxPxxxAxxL" motifs in their linkers. This strategy was devised to determine whether predictions could be improved by knowledge-based sequence sorting and-for some analyses-did increase the number of linker positions identified. However, two functionally important linker positions were not reliably identified by any analysis. Finally, we compared the new predictions to all known experimental data for E. coli LacI and three homologous linkers. From these, we estimate that >50% of positions are important to the functions of the LacI/GalR homologs. In corollary, neutral positions might occur less frequently and might be easier to detect in sequence analyses. Although analyses have successfully guided mutations that partially exchange protein functions, a better experimental understanding of the sequence/function relationships in protein families would be helpful for uncovering the remaining rules used by nature to evolve new protein functions.
Collapse
Affiliation(s)
- Sudheer Tungtur
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, MSN 3030, Kansas City, Kansas 66160, USA
| | | | | |
Collapse
|
10
|
Meysman P, Dang TH, Laukens K, De Smet R, Wu Y, Marchal K, Engelen K. Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli. Nucleic Acids Res 2010; 39:e6. [PMID: 21051340 PMCID: PMC3025552 DOI: 10.1093/nar/gkq1071] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Recognition of genomic binding sites by transcription factors can occur through base-specific recognition, or by recognition of variations within the structure of the DNA macromolecule. In this article, we investigate what information can be retrieved from local DNA structural properties that is relevant to transcription factor binding and that cannot be captured by the nucleotide sequence alone. More specifically, we explore the benefit of employing the structural characteristics of DNA to create binding-site models that encompass indirect recognition for the Escherichia coli model organism. We developed a novel methodology [Conditional Random fields of Smoothed Structural Data (CRoSSeD)], based on structural scales and conditional random fields to model and predict regulator binding sites. The value of relying on local structural-DNA properties is demonstrated by improved classifier performance on a large number of biological datasets, and by the detection of novel binding sites which could be validated by independent data sources, and which could not be identified using sequence data alone. We further show that the CRoSSeD-binding-site models can be related to the actual molecular mechanisms of the transcription factor DNA binding, and thus cannot only be used for prediction of novel sites, but might also give valuable insights into unknown binding mechanisms of transcription factors.
Collapse
Affiliation(s)
- Pieter Meysman
- Department of Microbial and Molecular systems, KU Leuven, Leuven Heverlee, Belgium
| | | | | | | | | | | | | |
Collapse
|
11
|
Comparing the functional roles of nonconserved sequence positions in homologous transcription repressors: implications for sequence/function analyses. J Mol Biol 2009; 395:785-802. [PMID: 19818797 DOI: 10.1016/j.jmb.2009.10.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2009] [Revised: 10/01/2009] [Accepted: 10/02/2009] [Indexed: 11/21/2022]
Abstract
The explosion of protein sequences deduced from genetic code has led to both a problem and a potential resource: Efficient data use requires interpreting the functional impact of sequence change without experimentally characterizing each protein variant. Several groups have hypothesized that interpretation could be aided by analyzing the sequences of naturally occurring homologues. To that end, myriad sequence/function analyses have been developed to predict which conserved, semi-conserved, and nonconserved positions are functionally important. These positions must be discriminated from the nonconserved positions that are functionally silent. However, the assumptions that underlie sequence analyses are based on experimental results that are sparse and usually designed to address different questions. Here, we use three homologues from a test family common to bioinformatics-the LacI/GalR transcription repressors-to test a common assumption: If a position is functionally important for one family member, it has similar importance in all homologues. We generated experimental sequence/function information for each nonconserved position in the 18 amino acids that link the DNA-binding and regulatory domains of three LacI/GalR homologues. We find that the functional importance of each position is preserved among the three linkers, albeit to different degrees. We also find that every linker position contributes to function, which has twofold implications. (1) Since the linker positions range from highly conserved to semi-conserved to nonconserved and contribute to affinity, selectivity, and allosteric response, we assert that sequence/function analyses must identify positions in the LacI/GalR linkers to be qualified as "successful". Many analyses overlook this region since most of the residues do not directly contact ligand. (2) No position in the LacI/GalR linker is functionally silent. This finding is inconsistent with another underlying principle of many analyses: Using sequence sets to discriminate important from non-contributing positions obligates silent positions, which denotes that most homologues tolerate a variety of amino acid substitutions at the position without functional change. Instead, additional combinatorial mutants in the LacI/GalR linkers show that particular substitutions can be silent in a context-dependent manner. Thus, specific permutations of sequence change (rather than change at silent positions) would facilitate neutral drift during evolution. Finally, the combinatorial mutants also reveal functional synergy between semi- and nonconserved positions. Such functional relationships would be missed by analyses that rely primarily upon co-evolution.
Collapse
|
12
|
Jamal Rahi S, Virnau P, Mirny LA, Kardar M. Predicting transcription factor specificity with all-atom models. Nucleic Acids Res 2008; 36:6209-17. [PMID: 18829719 PMCID: PMC2577325 DOI: 10.1093/nar/gkn589] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
The binding of a transcription factor (TF) to a DNA operator site can initiate or repress the expression of a gene. Computational prediction of sites recognized by a TF has traditionally relied upon knowledge of several cognate sites, rather than an ab initio approach. Here, we examine the possibility of using structure-based energy calculations that require no knowledge of bound sites but rather start with the structure of a protein–DNA complex. We study the PurR Escherichia coli TF, and explore to which extent atomistic models of protein–DNA complexes can be used to distinguish between cognate and noncognate DNA sites. Particular emphasis is placed on systematic evaluation of this approach by comparing its performance with bioinformatic methods, by testing it against random decoys and sites of homologous TFs. We also examine a set of experimental mutations in both DNA and the protein. Using our explicit estimates of energy, we show that the specificity for PurR is dominated by direct protein–DNA interactions, and weakly influenced by bending of DNA.
Collapse
Affiliation(s)
- Sahand Jamal Rahi
- Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA, Staudinger Weg 7, Institut für Physik, 55099 Mainz, Germany and Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Peter Virnau
- Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA, Staudinger Weg 7, Institut für Physik, 55099 Mainz, Germany and Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
- *To whom correspondence should be addressed. Tel: +49 6131 392 3646; Fax: +49 6131 392 5441;
| | - Leonid A. Mirny
- Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA, Staudinger Weg 7, Institut für Physik, 55099 Mainz, Germany and Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Mehran Kardar
- Department of Physics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA, Staudinger Weg 7, Institut für Physik, 55099 Mainz, Germany and Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| |
Collapse
|
13
|
Devroede N, Thia-Toong TL, Gigot D, Maes D, Charlier D. Purine and pyrimidine-specific repression of the Escherichia coli carAB operon are functionally and structurally coupled. J Mol Biol 2004; 336:25-42. [PMID: 14741201 DOI: 10.1016/j.jmb.2003.12.024] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Transcription of the carAB operon encoding the sole carbamoylphosphate synthetase of Escherichia coli proceeds from a tandem pair of promoters. P2, downstream, is repressed by arginine and the ArgR protein, whereas P1 is submitted to pyrimidine-specific regulation and as shown here to purine-specific control exerted by binding of the PurR protein to a PUR box sequence centered around nucleotide -128.5 with respect to the start of P1 transcription. In vivo analyses of the effects of trans and cis-acting mutations on the regulatory responses and single round in vitro transcription assays indicated that ligand-bound PurR is by itself unable to inhibit P1 promoter activity. To exert its effect PurR relies on the elaborated nucleoprotein complex that governs P1 activity in a pyrimidine-specific manner. Thus we reveal the existence of an unprecedented functional and structural coupling between the modulation of P1 activity by purine and pyrimidine residues that appears to result from the unique position of the PUR box in the carAB control region, far upstream of the promoter. Missing contact and premethylation binding interference studies revealed the importance of base-specific groups and of structural aspects of the PUR box sequence in complex formation. Permutation assays indicated that the overall PurR-induced bending of the carAB control region is slightly less pronounced than that of the purF operator. The PUR boxes of the carAB operon of E.coli and Salmonella typhimurium are unique in that they have a guanine residue at position eight. Interestingly, guanine at this position has been proposed to be extremely unfavorable on the basis of modeling and binding studies, as its exocyclic amino group would enter into a steric clash with the side-chain of lysine 55. To analyze the effect of guanine at position eight in the upstream half-site of the carAB operator we constructed the adenine derivative and assayed in vivo repressibility of P1 promoter activity and in vitroPurR binding to the mutant operator, and constructed a molecular model for the unusual lysine 55-guanine 8 interaction.
Collapse
Affiliation(s)
- Neel Devroede
- Erfelijkheidsleer en Microbiologie, Vrije Universiteit Brussel (VUB), Pleinlaan 2, B-1050 Brussels, Belgium
| | | | | | | | | |
Collapse
|
14
|
Swint-Kruse L, Larson C, Pettitt BM, Matthews KS. Fine-tuning function: correlation of hinge domain interactions with functional distinctions between LacI and PurR. Protein Sci 2002; 11:778-94. [PMID: 11910022 PMCID: PMC2373529 DOI: 10.1110/ps.4050102] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
LacI and PurR are highly homologous proteins. Their functional units are homodimers, with an N-terminal DNA binding domain that comprises the helix-turn-helix (HTH), N-linker, and hinge regions from both monomers. Hinge structural changes are known to occur upon DNA dissociation but are difficult to monitor experimentally. The initial steps of hinge unfolding were therefore examined using molecular dynamics simulations, utilizing a truncated, chimeric protein comprising the LacI HTH/N-linker and PurR hinge. A terminal Gly-Cys-Gly was added to allow "dimerization" through disulfide bond formation. Simulations indicate that differences in LacI and PurR hinge primary sequence affect the quaternary structure of the hinge x hinge' interface. However, these alternate hinge orientations would be sterically restricted by the core domain. These results prompted detailed comparison of recently available DNA-bound structures for LacI and truncated LacI(1-62) with the PurR structure. Examination revealed that different N-linker and hinge contacts to the core domain of the partner monomer (which binds effector molecule) affect the juxtapositions of the HTH, N-linker, and hinge regions in the DNA binding domain. In addition, the two full-length repressors exhibit significant differences in the interactions between the core and the C-linker connection to the DNA binding domain. Both linkers and the hinge have been implicated in the allosteric response of these repressors. Intriguingly, one functional difference between these two proteins is that they exhibit opposite allosteric response to effector. Simulations and observed structural distinctions are correlated with mutational analysis and sequence information from the LacI/GalR family to formulate a mechanism for fine-tuning individual repressor function.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas 77005, USA.
| | | | | | | |
Collapse
|
15
|
Pabo CO, Nekludova L. Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition? J Mol Biol 2000; 301:597-624. [PMID: 10966773 DOI: 10.1006/jmbi.2000.3918] [Citation(s) in RCA: 198] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Structural studies of protein-DNA complexes have shown that there are many distinct families of DNA-binding proteins, and have shown that there is no simple "code" describing side-chain/base interactions. However, systematic analysis and comparison of protein-DNA complexes has been complicated by the diversity of observed contacts, the sheer number of complexes currently available and the absence of any consistent method of comparison that retains detailed structural information about the protein-DNA interface. To address these problems, we have developed geometric methods for characterizing the local structural environment in which particular side-chain/base interactions are observed. In particular, we develop methods for analyzing and comparing spatial relationships at the protein-DNA interface. Our method involves attaching local coordinate systems to the DNA bases and to the C(alpha) atoms of the peptide backbone (these are relatively rigid structural units). We use these tools to consider how the position and orientation of the polypeptide backbone (with respect to the DNA) helps to determine what contacts are possible at any given position in a protein-DNA complex. Here, we focus on base contacts that are made in the major groove, and we use spatial relationships in analyzing: (i) the observed patterns of side-chain/base interactions; (ii) observed helix docking orientations; (iii) family/subfamily relationships among DNA-binding proteins; and (iv) broader questions about evolution, altered specificity mutants and the limits for the design of new DNA-binding proteins. Our analysis, which highlights differences in spatial relationships in different complexes and at different positions in a complex, helps explain why there is no simple, general code for protein-DNA recognition.
Collapse
Affiliation(s)
- C O Pabo
- Howard Hughes Medical Institute, Department of Biology 68-580, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. pabo@,it.edu
| | | |
Collapse
|
16
|
Spronk CA, Bonvin AM, Radha PK, Melacini G, Boelens R, Kaptein R. The solution structure of Lac repressor headpiece 62 complexed to a symmetrical lac operator. Structure 1999; 7:1483-92. [PMID: 10647179 DOI: 10.1016/s0969-2126(00)88339-2] [Citation(s) in RCA: 72] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Abstract
BACKGROUND Lactose repressor protein (Lac) controls the expression of the lactose metabolic genes in Escherichia coli by binding to an operator sequence in the promoter of the lac operon. Binding of inducer molecules to the Lac core domain induces changes in tertiary structure that are propagated to the DNA-binding domain through the connecting hinge region, thereby reducing the affinity for the operator. Protein-protein and protein-DNA interactions involving the hinge region play a crucial role in the allosteric changes occurring upon induction, but have not, as yet, been analyzed in atomic detail. RESULTS We have used nuclear magnetic resonance (NMR) spectroscopy and restrained molecular dynamics (rMD) to determine the structure of the Lac repressor DNA-binding domain (headpeice 62; HP62) in complex with a symmetrized lac operator. Analysis of the structures reveals specific interactions between Lac repressor and DNA that were not found in previously investigated Lac repressor-DNA complexes. Important differences with the previously reported structures of the HP56-DNA complex were found in the loop following the helix-turn-helix (HTH) motif. The protein-protein and protein-DNA interactions involving the hinge region and the deformations in the DNA structure could be delineated in atomic detail. The structures were also used for comparison with the available crystallographic data on the Lac and Pur repressor-DNA complexes. CONCLUSIONS The structures of the HP62-DNA complex provide the basis for a better understanding of the specific recognition in the Lac repressor-operator complex. In addition, the structural features of the hinge region provide detailed insight into the protein-protein and protein-DNA interactions responsible for the high affinity of the repressor for operator DNA.
Collapse
Affiliation(s)
- C A Spronk
- Bijvoet Center for Biomolecular Research, Utrecht University, The Netherlands
| | | | | | | | | | | |
Collapse
|
17
|
Glasfeld A, Koehler AN, Schumacher MA, Brennan RG. The role of lysine 55 in determining the specificity of the purine repressor for its operators through minor groove interactions. J Mol Biol 1999; 291:347-61. [PMID: 10438625 DOI: 10.1006/jmbi.1999.2946] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The interaction of the dimeric Escherichia coli purine repressor (PurR) with its cognate sequences leads to a 45 degrees to 50 degrees kink at a central CpG base step towards the major groove, as dyad-related leucine side-chains interdigitate between these bases from the minor groove. The resulting broadening of the minor groove increases the accessibility of the six central base-pairs towards minor groove interactions with residues from PurR. It has been shown that lysine 55 of PurR makes a direct contact with the adenine base (Ade8) directly 5' to the central CpG base-pair step in the high-affinity purF operator sequence. We have investigated the importance of this interaction in the specificity and affinity of wild-type PurR (WT) for its operators and we have studied a mutant of PurR in which Lys55 is replaced with alanine (K55A). Complexes of WT and K55A with duplex DNA containing pur operator sequences varied at position 8 were investigated crystallographically, and binding studies were performed using fluorescence anisotropy. The structures of the protein-DNA complexes reveal a relatively unperturbed global conformation regardless of the identity of the base-pair at position 8 or residue 55. In all structures the combination of higher resolution and a palindromic purF operator site allowed several new PurR.DNA interactions to be observed, including contacts by Thr15, Thr16 and His20. The side-chain of Lys55 makes productive, though varying, interactions with the adenine, thymine or cytosine base at position 8 that result in equilibrium dissociation constants of 2.6 nM, 10 nM and 35 nM, respectively. However, the bulk of the lysine side-chain apparently blocks high-affinity binding of operators with guanine at position 8 (Kd620 nM). Also, the high-affinity binding conformation appears blocked, as crystals of WT bound to DNA with guanine at position 8 could not be grown. In complexes containing K55A, the alanine side-chain is too far removed to engage in van der Waals interactions with the operator, and, with the loss of the general electrostatic interaction between the phosphate backbone and the ammonium group of lysine, K55A binds each operator weakly. However, the mutation leads to a swap of specificity of PurR for the base at position 8, with K55A exhibiting a twofold preference for guanine over adenine. In addition to defining the role of Lys55 in PurR minor groove binding, these studies provide structural insight into the minor groove binding specificities of other LacI/GalR family members that have either alanine (e.g. LacI, GalR, CcpA) or a basic residue (e.g. RafR, ScrR, RbtR) at the comparable position.
Collapse
Affiliation(s)
- A Glasfeld
- Department of Biochemistry and Molecular Biology, Oregon Health Sciences University, Portland, OR, 97201-3098, USA
| | | | | | | |
Collapse
|
18
|
Abstract
Growing interest in understanding the relationship between the global folding of nucleic acids and the sequence-dependent structure of individual base-pair steps has stimulated the development of new mathematical methods to define the geometry of the constituent base-pairs. Several approaches, designed to meet guidelines set by the nucleic acid community, permit rigorous comparative analyses of different three-dimensional structures, as well as allow for reconstruction of chain molecules at the base-pair level. The different computer programs, however, yield inconsistent descriptions of chain conformation. Here we report our own implementation of seven algorithms used to determine base-pair and dimer step parameters. Aside from reproducing the results of individual programs, we uncover the reasons why the different algorithms come to conflicting structural interpretations. The choice of mathematics has only a limited effect on the computed parameters, even in highly deformed duplexes. The results are much more sensitive to the choice of reference frame. The disparate schemes yield very similar conformational descriptions if the calculations are based on a common reference frame. The current positioning of reference frames at the inner and outer edges of complementary bases exaggerates the rise at distorted dimer steps, and points to the need for a carefully defined conformational standard.
Collapse
Affiliation(s)
- X J Lu
- Department of Chemistry, Rutgers, the State University of New Jersey, Wright-Rieman Laboratories, 610 Taylor Road, Piscataway, NJ, 08854-8087, USA
| | | |
Collapse
|