1
|
Morea V, Angelucci F, Bellelli A. Is allostery a fuzzy concept? FEBS Open Bio 2024; 14:1040-1056. [PMID: 38783588 PMCID: PMC11216940 DOI: 10.1002/2211-5463.13794] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/30/2024] [Accepted: 03/11/2024] [Indexed: 05/25/2024] Open
Abstract
Allostery is an important property of biological macromolecules which regulates diverse biological functions such as catalysis, signal transduction, transport, and molecular recognition. However, the concept was expressed using two different definitions by J. Monod and, over time, more have been added by different authors, making it fuzzy. Here, we reviewed the different meanings of allostery in the current literature and found that it has been used to indicate that the function of a protein is regulated by heterotropic ligands, and/or that the binding of ligands and substrates presents homotropic positive or negative cooperativity, whatever the hypothesized or demonstrated reaction mechanism might be. Thus, proteins defined to be allosteric include not only those that obey the two-state concerted model, but also those that obey different reaction mechanisms such as ligand-induced fit, possibly coupled to sequential structure changes, and ligand-linked dissociation-association. Since each reaction mechanism requires its own mathematical description and is defined by it, there are many possible 'allosteries'. This lack of clarity is made even fuzzier by the fact that the reaction mechanism is often assigned imprecisely and/or implicitly in the absence of the necessary experimental evidence. In this review, we examine a list of proteins that have been defined to be allosteric and attempt to assign a reaction mechanism to as many as possible.
Collapse
Affiliation(s)
- Veronica Morea
- Institute of Molecular Biology and Pathology, CNRRomeItaly
| | - Francesco Angelucci
- Department of Life, Health, and Environmental SciencesUniversity of L'AquilaItaly
| | - Andrea Bellelli
- Department of Biochemical Sciences “A. Rossi Fanelli”Sapienza University of RomeItaly
| |
Collapse
|
2
|
Swint-Kruse L, Fenton AW. Rheostats, toggles, and neutrals, Oh my! A new framework for understanding how amino acid changes modulate protein function. J Biol Chem 2024; 300:105736. [PMID: 38336297 PMCID: PMC10914490 DOI: 10.1016/j.jbc.2024.105736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/09/2024] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open
Abstract
Advances in personalized medicine and protein engineering require accurately predicting outcomes of amino acid substitutions. Many algorithms correctly predict that evolutionarily-conserved positions show "toggle" substitution phenotypes, which is defined when a few substitutions at that position retain function. In contrast, predictions often fail for substitutions at the less-studied "rheostat" positions, which are defined when different amino acid substitutions at a position sample at least half of the possible functional range. This review describes efforts to understand the impact and significance of rheostat positions: (1) They have been observed in globular soluble, integral membrane, and intrinsically disordered proteins; within single proteins, their prevalence can be up to 40%. (2) Substitutions at rheostat positions can have biological consequences and ∼10% of substitutions gain function. (3) Although both rheostat and "neutral" (defined when all substitutions exhibit wild-type function) positions are nonconserved, the two classes have different evolutionary signatures. (4) Some rheostat positions have pleiotropic effects on function, simultaneously modulating multiple parameters (e.g., altering both affinity and allosteric coupling). (5) In structural studies, substitutions at rheostat positions appear to cause only local perturbations; the overall conformations appear unchanged. (6) Measured functional changes show promising correlations with predicted changes in protein dynamics; the emergent properties of predicted, dynamically coupled amino acid networks might explain some of the complex functional outcomes observed when substituting rheostat positions. Overall, rheostat positions provide unique opportunities for using single substitutions to tune protein function. Future studies of these positions will yield important insights into the protein sequence/function relationship.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
3
|
Matilla MA, Velando F, Martín-Mora D, Monteagudo-Cascales E, Krell T. A catalogue of signal molecules that interact with sensor kinases, chemoreceptors and transcriptional regulators. FEMS Microbiol Rev 2021; 46:6356564. [PMID: 34424339 DOI: 10.1093/femsre/fuab043] [Citation(s) in RCA: 57] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 08/10/2021] [Indexed: 12/12/2022] Open
Abstract
Bacteria have evolved many different signal transduction systems that sense signals and generate a variety of responses. Generally, most abundant are transcriptional regulators, sensor histidine kinases and chemoreceptors. Typically, these systems recognize their signal molecules with dedicated ligand-binding domains (LBDs), which, in turn, generate a molecular stimulus that modulates the activity of the output module. There are an enormous number of different LBDs that recognize a similarly diverse set of signals. To give a global perspective of the signals that interact with transcriptional regulators, sensor kinases and chemoreceptors, we manually retrieved information on the protein-ligand interaction from about 1,200 publications and 3D structures. The resulting 811 proteins were classified according to the Pfam family into 127 groups. These data permit a delineation of the signal profiles of individual LBD families as well as distinguishing between families that recognize signals in a promiscuous manner and those that possess a well-defined ligand range. A major bottleneck in the field is the fact that the signal input of many signaling systems is unknown. The signal repertoire reported here will help the scientific community design experimental strategies to identify the signaling molecules for uncharacterised sensor proteins.
Collapse
Affiliation(s)
- Miguel A Matilla
- Department of Environmental Protection, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Prof. Albareda 1, 18008 Granada, Spain
| | - Félix Velando
- Department of Environmental Protection, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Prof. Albareda 1, 18008 Granada, Spain
| | - David Martín-Mora
- Department of Environmental Protection, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Prof. Albareda 1, 18008 Granada, Spain
| | - Elizabet Monteagudo-Cascales
- Department of Environmental Protection, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Prof. Albareda 1, 18008 Granada, Spain
| | - Tino Krell
- Department of Environmental Protection, Estación Experimental del Zaidín, Consejo Superior de Investigaciones Científicas, Prof. Albareda 1, 18008 Granada, Spain
| |
Collapse
|
4
|
Kinnersley M, Schwartz K, Yang DD, Sherlock G, Rosenzweig F. Evolutionary dynamics and structural consequences of de novo beneficial mutations and mutant lineages arising in a constant environment. BMC Biol 2021; 19:20. [PMID: 33541358 PMCID: PMC7863352 DOI: 10.1186/s12915-021-00954-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Accepted: 01/08/2021] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Microbial evolution experiments can be used to study the tempo and dynamics of evolutionary change in asexual populations, founded from single clones and growing into large populations with multiple clonal lineages. High-throughput sequencing can be used to catalog de novo mutations as potential targets of selection, determine in which lineages they arise, and track the fates of those lineages. Here, we describe a long-term experimental evolution study to identify targets of selection and to determine when, where, and how often those targets are hit. RESULTS We experimentally evolved replicate Escherichia coli populations that originated from a mutator/nonsense suppressor ancestor under glucose limitation for between 300 and 500 generations. Whole-genome, whole-population sequencing enabled us to catalog 3346 de novo mutations that reached > 1% frequency. We sequenced the genomes of 96 clones from each population when allelic diversity was greatest in order to establish whether mutations were in the same or different lineages and to depict lineage dynamics. Operon-specific mutations that enhance glucose uptake were the first to rise to high frequency, followed by global regulatory mutations. Mutations related to energy conservation, membrane biogenesis, and mitigating the impact of nonsense mutations, both ancestral and derived, arose later. New alleles were confined to relatively few loci, with many instances of identical mutations arising independently in multiple lineages, among and within replicate populations. However, most never exceeded 10% in frequency and were at a lower frequency at the end of the experiment than at their maxima, indicating clonal interference. Many alleles mapped to key structures within the proteins that they mutated, providing insight into their functional consequences. CONCLUSIONS Overall, we find that when mutational input is increased by an ancestral defect in DNA repair, the spectrum of high-frequency beneficial mutations in a simple, constant resource-limited environment is narrow, resulting in extreme parallelism where many adaptive mutations arise but few ever go to fixation.
Collapse
Affiliation(s)
- Margie Kinnersley
- Division of Biological Sciences, The University of Montana, Missoula, MT, 59812, USA
| | - Katja Schwartz
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305-5120, USA
| | - Dong-Dong Yang
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332, USA
| | - Gavin Sherlock
- Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305-5120, USA.
| | - Frank Rosenzweig
- Division of Biological Sciences, The University of Montana, Missoula, MT, 59812, USA.
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, 30332, USA.
| |
Collapse
|
5
|
Ye F, Wang C, Fu Q, Yan XF, Bharath SR, Casanas A, Wang M, Song H, Zhang LH, Gao YG. Structural basis of a novel repressor, SghR, controlling Agrobacterium infection by cross-talking to plants. J Biol Chem 2020; 295:12290-12304. [PMID: 32651231 PMCID: PMC7443487 DOI: 10.1074/jbc.ra120.012908] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Revised: 07/03/2020] [Indexed: 11/06/2022] Open
Abstract
Agrobacterium tumefaciens infects various plants and causes crown gall diseases involving temporal expression of virulence factors. SghA is a newly identified virulence factor enzymatically releasing salicylic acid from its glucoside conjugate and controlling plant tumor development. Here, we report the structural basis of SghR, a LacI-type transcription factor highly conserved in Rhizobiaceae family, regulating the expression of SghA and involved in tumorigenesis. We identified and characterized the binding site of SghR on the promoter region of sghA and then determined the crystal structures of apo-SghR, SghR complexed with its operator DNA, and ligand sucrose, respectively. These results provide detailed insights into how SghR recognizes its cognate DNA and shed a mechanistic light on how sucrose attenuates the affinity of SghR with DNA to modulate the expression of SghA. Given the important role of SghR in mediating the signaling cross-talk during Agrobacterium infection, our results pave the way for structure-based inducer analog design, which has potential applications for agricultural industry.
Collapse
Affiliation(s)
- Fuzhou Ye
- School of Biological Sciences, Nanyang Technological University, Singapore
| | - Chao Wang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, China
| | - Qinqin Fu
- School of Biological Sciences, Nanyang Technological University, Singapore
| | - Xin-Fu Yan
- School of Biological Sciences, Nanyang Technological University, Singapore
| | | | - Arnau Casanas
- Swiss Light Source at Paul Scherrer Institut, Villigen, Switzerland
| | - Meitian Wang
- Swiss Light Source at Paul Scherrer Institut, Villigen, Switzerland
| | - Haiwei Song
- Institute of Molecular and Cell Biology, Singapore
| | - Lian-Hui Zhang
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, China
- Institute of Molecular and Cell Biology, Singapore
| | - Yong-Gui Gao
- School of Biological Sciences, Nanyang Technological University, Singapore
- Institute of Molecular and Cell Biology, Singapore
| |
Collapse
|
6
|
Charlier D, Nguyen Le Minh P, Roovers M. Regulation of carbamoylphosphate synthesis in Escherichia coli: an amazing metabolite at the crossroad of arginine and pyrimidine biosynthesis. Amino Acids 2018; 50:1647-1661. [PMID: 30238253 PMCID: PMC6245113 DOI: 10.1007/s00726-018-2654-z] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Accepted: 09/11/2018] [Indexed: 12/17/2022]
Abstract
In all organisms, carbamoylphosphate (CP) is a precursor common to the synthesis of arginine and pyrimidines. In Escherichia coli and most other Gram-negative bacteria, CP is produced by a single enzyme, carbamoylphosphate synthase (CPSase), encoded by the carAB operon. This particular situation poses a question of basic physiological interest: what are the metabolic controls coordinating the synthesis and distribution of this high-energy substance in view of the needs of both pathways? The study of the mechanisms has revealed unexpected moonlighting gene regulatory activities of enzymes and functional links between mechanisms as diverse as gene regulation and site-specific DNA recombination. At the level of enzyme production, various regulatory mechanisms were found to cooperate in a particularly intricate transcriptional control of a pair of tandem promoters. Transcription initiation is modulated by an interplay of several allosteric DNA-binding transcription factors using effector molecules from three different pathways (arginine, pyrimidines, purines), nucleoid-associated factors (NAPs), trigger enzymes (enzymes with a second unlinked gene regulatory function), DNA remodeling (bending and wrapping), UTP-dependent reiterative transcription initiation, and stringent control by the alarmone ppGpp. At the enzyme level, CPSase activity is tightly controlled by allosteric effectors originating from different pathways: an inhibitor (UMP) and two activators (ornithine and IMP) that antagonize the inhibitory effect of UMP. Furthermore, it is worth noticing that all reaction intermediates in the production of CP are extremely reactive and unstable, and protected by tunneling through a 96 Å long internal channel.
Collapse
Affiliation(s)
- Daniel Charlier
- Research Group of Microbiology, Department of Bio-engineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050, Brussels, Belgium.
| | - Phu Nguyen Le Minh
- Research Group of Microbiology, Department of Bio-engineering Sciences, Vrije Universiteit Brussel, Pleinlaan 2, 1050, Brussels, Belgium
| | - Martine Roovers
- LABIRIS Institut de Recherches, Av. Emile Gryson 1, 1070, Brussels, Belgium
| |
Collapse
|
7
|
Fu Y, Yeom SJ, Kwon KK, Hwang J, Kim H, Woo EJ, Lee DH, Lee SG. Structural and functional analyses of the cellulase transcription regulator CelR. FEBS Lett 2018; 592:2776-2785. [PMID: 30062758 DOI: 10.1002/1873-3468.13206] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2018] [Revised: 07/05/2018] [Accepted: 07/19/2018] [Indexed: 11/10/2022]
Abstract
CelR is a transcriptional regulator that controls the expression of cellulases catalyzing cellulose hydrolysis. However, the structural mechanism of its regulation has remained unclear. Here, we report the first structure of CelR, in this case with cellobiose bound. CelR consists of a DNA-binding domain (DBD) and a regulatory domain (RD), and homodimerizes with each monomer bound to cellobiose. A hinge region (HR) in CelR connects the DBD with the RD, and Leu59 in the HR acts as a 'leucine lever' that transduces a transcriptional activation signal. Furthermore, an α4 helix mediates the ligand-binding signal for transcriptional activation. Tyr84 and Gln301 can potentially alter the ligand specificity of CelR. This study provides a pivotal step toward understanding transcription of the cellulases.
Collapse
Affiliation(s)
- Yaoyao Fu
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,The Key Laboratory of Biotechnology for Medicinal Plant of Jiangsu Province, Jiangsu Normal University, China
| | - Soo-Jin Yeom
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea
| | - Kil Koang Kwon
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea
| | - Jungwon Hwang
- Infection and Immunity Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea
| | - Haseong Kim
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Biosystems and Bioengineering, KRIBB School of Biotechnology, University of Science and Technology (UST), Daejeon, Korea
| | - Eui-Jeon Woo
- Disease Target Structure Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Bio-Analytical Science, KRIBB School of Bioscience, University of Science and Technology (UST), Daejeon, Korea
| | - Dae-Hee Lee
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Biosystems and Bioengineering, KRIBB School of Biotechnology, University of Science and Technology (UST), Daejeon, Korea
| | - Seung-Goo Lee
- Synthetic Biology and Bioengineering Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, Korea.,Department of Biosystems and Bioengineering, KRIBB School of Biotechnology, University of Science and Technology (UST), Daejeon, Korea
| |
Collapse
|
8
|
Xu JS, Hewitt MN, Gulati JS, Cruz MA, Zhan H, Liu S, Matthews KS. Lactose repressor hinge domain independently binds DNA. Protein Sci 2018; 27:839-847. [PMID: 29318690 PMCID: PMC5866929 DOI: 10.1002/pro.3372] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2017] [Revised: 01/02/2018] [Accepted: 01/02/2018] [Indexed: 12/29/2022]
Abstract
The short 8-10 amino acid "hinge" sequence in lactose repressor (LacI), present in other LacI/GalR family members, links DNA and inducer-binding domains. Structural studies of full-length or truncated LacI-operator DNA complexes demonstrate insertion of the dimeric helical "hinge" structure at the center of the operator sequence. This association bends the DNA ∼40° and aligns flanking semi-symmetric DNA sites for optimal contact by the N-terminal helix-turn-helix (HtH) sequences within each dimer. In contrast, the hinge region remains unfolded when bound to nonspecific DNA sequences. To determine ability of the hinge helix alone to mediate DNA binding, we examined (i) binding of LacI variants with deletion of residues 1-50 to remove the HtH DNA binding domain or residues 1-58 to remove both HtH and hinge domains and (ii) binding of a synthetic peptide corresponding to the hinge sequence with a Val52Cys substitution that allows reversible dimer formation via a disulfide linkage. Binding affinity for DNA is orders of magnitude lower in the absence of the helix-turn-helix domain with its highly positive charge. LacI missing residues 1-50 binds to DNA with ∼4-fold greater affinity for operator than for nonspecific sequences with minimal impact of inducer presence; in contrast, LacI missing residues 1-58 exhibits no detectable affinity for DNA. In oxidized form, the dimeric hinge peptide alone binds to O1 and nonspecific DNA with similarly small difference in affinity; reduction to monomer diminished binding to both O1 and nonspecific targets. These results comport with recent reports regarding LacI hinge interaction with DNA sequences.
Collapse
Affiliation(s)
- Joseph S Xu
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Madeleine N Hewitt
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Jaskeerat S Gulati
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Matthew A Cruz
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Hongli Zhan
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | - Shirley Liu
- Department of BioSciences, MS-140, Rice University, Houston, Texas, 77251
| | | |
Collapse
|
9
|
Swint-Kruse L. Using Evolution to Guide Protein Engineering: The Devil IS in the Details. Biophys J 2017; 111:10-8. [PMID: 27410729 DOI: 10.1016/j.bpj.2016.05.030] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Revised: 04/18/2016] [Accepted: 05/20/2016] [Indexed: 10/21/2022] Open
Abstract
For decades, protein engineers have endeavored to reengineer existing proteins for novel applications. Overall, protein folds and gross functions can be readily transferred from one protein to another by transplanting large blocks of sequence (i.e., domain recombination). However, predictably fine-tuning function (e.g., by adjusting ligand affinity, specificity, catalysis, and/or allosteric regulation) remains a challenge. One approach has been to use the sequences of protein families to identify amino acid positions that change during the evolution of functional variation. The rationale is that these nonconserved positions could be mutated to predictably fine-tune function. Evolutionary approaches to protein design have had some success, but the engineered proteins seldom replicate the functional performances of natural proteins. This Biophysical Perspective reviews several complexities that have been revealed by evolutionary and experimental studies of protein function. These include 1) challenges in defining computational and biological thresholds that define important amino acids; 2) the co-occurrence of many different patterns of amino acid changes in evolutionary data; 3) difficulties in mapping the patterns of amino acid changes to discrete functional parameters; 4) the nonconventional mutational outcomes that occur for a particular group of functionally important, nonconserved positions; 5) epistasis (nonadditivity) among multiple mutations; and 6) the fact that a large fraction of a protein's amino acids contribute to its overall function. To overcome these challenges, new goals are identified for future studies.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, Kansas.
| |
Collapse
|
10
|
Zhang Q, Wang C, Wan M, Wu Y, Ma Q. Streptococcus pneumoniae Genome-wide Identification and Characterization of BOX Element-binding Domains. Mol Inform 2016; 34:742-52. [PMID: 27491035 DOI: 10.1002/minf.201500044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2015] [Indexed: 11/11/2022]
Abstract
The BOX elements are short repetitive DNA sequences that distribute randomly in intergenic regions of the Streptococcus pneumoniae genome. The function and origin of such elements are still unknown, but they were found to modulate expression of neighboring genes. Evidences suggested that the modulation's mechanism can be fulfilled by sequence-specific interaction of BOX elements with transcription factor family proteins. However, the type and function of these BOX-binding proteins still remain largely unexplored to date. In the current study we described a synthetic protocol to investigate the recognition and interaction between a highly conserved site of BOX elements and the DNA-binding domains of a variety of putative transcription factors in the pneumococcal genome. With the protocol we were able to predict those high-affinity domain binders of the conserved BOX DNA site (BOX DNA) in a high-throughput manner, and analyzed sequence-specific interaction in the domainDNA recognition at molecular level. Consequently, a number of putative transcription factor domains with both high affinity and specificity for the BOX DNA were identified, from which the helix-turn-helix (HTH) motif of a small heat shock factor was selected as a case study and tested for its binding capability toward the double-stranded BOX DNA using fluorescence anisotropy analysis. As might be expected, a relatively high affinity was detected for the interaction of HTH motif with BOX DNA with dissociation constant at nanomolar level. Molecular dynamics simulation, atomic structure examination and binding energy analysis revealed a complicated network of intensive nonbonded interactions across the complex interface, which confers both stability and specificity for the complex architecture.
Collapse
Affiliation(s)
- Qiao Zhang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Changzheng Wang
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Min Wan
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Yin Wu
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| | - Qianli Ma
- Institute of Respiratory Diseases, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, P.R. China
| |
Collapse
|
11
|
Sousa FL, Parente DJ, Hessman JA, Chazelle A, Teichmann SA, Swint-Kruse L. Data on publications, structural analyses, and queries used to build and utilize the AlloRep database. Data Brief 2016; 8:948-57. [PMID: 27508249 PMCID: PMC4961497 DOI: 10.1016/j.dib.2016.07.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Revised: 06/22/2016] [Accepted: 07/04/2016] [Indexed: 01/08/2023] Open
Abstract
The AlloRep database (www.AlloRep.org) (Sousa et al., 2016) [1] compiles extensive sequence, mutagenesis, and structural information for the LacI/GalR family of transcription regulators. Sequence alignments are presented for >3000 proteins in 45 paralog subfamilies and as a subsampled alignment of the whole family. Phenotypic and biochemical data on almost 6000 mutants have been compiled from an exhaustive search of the literature; citations for these data are included herein. These data include information about oligomerization state, stability, DNA binding and allosteric regulation. Protein structural data for 65 proteins are presented as easily-accessible, residue-contact networks. Finally, this article includes example queries to enable the use of the AlloRep database. See the related article, “AlloRep: a repository of sequence, structural and mutagenesis data for the LacI/GalR transcription regulators” (Sousa et al., 2016) [1].
Collapse
Affiliation(s)
- Filipa L Sousa
- Institute of Molecular Evolution, Heinrich-Heine Universität Düsseldorf, Universitätstrasse 1, 40225 Düsseldorf, Germany
| | - Daniel J Parente
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Jacob A Hessman
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Allen Chazelle
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | - Sarah A Teichmann
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK; Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Liskin Swint-Kruse
- The Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| |
Collapse
|
12
|
Parente DJ, Ray JCJ, Swint-Kruse L. Amino acid positions subject to multiple coevolutionary constraints can be robustly identified by their eigenvector network centrality scores. Proteins 2015; 83:2293-306. [PMID: 26503808 DOI: 10.1002/prot.24948] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Revised: 09/21/2015] [Accepted: 10/14/2015] [Indexed: 12/21/2022]
Abstract
As proteins evolve, amino acid positions key to protein structure or function are subject to mutational constraints. These positions can be detected by analyzing sequence families for amino acid conservation or for coevolution between pairs of positions. Coevolutionary scores are usually rank-ordered and thresholded to reveal the top pairwise scores, but they also can be treated as weighted networks. Here, we used network analyses to bypass a major complication of coevolution studies: For a given sequence alignment, alternative algorithms usually identify different, top pairwise scores. We reconciled results from five commonly-used, mathematically divergent algorithms (ELSC, McBASC, OMES, SCA, and ZNMI), using the LacI/GalR and 1,6-bisphosphate aldolase protein families as models. Calculations used unthresholded coevolution scores from which column-specific properties such as sequence entropy and random noise were subtracted; "central" positions were identified by calculating various network centrality scores. When compared among algorithms, network centrality methods, particularly eigenvector centrality, showed markedly better agreement than comparisons of the top pairwise scores. Positions with large centrality scores occurred at key structural locations and/or were functionally sensitive to mutations. Further, the top central positions often differed from those with top pairwise coevolution scores: instead of a few strong scores, central positions often had multiple, moderate scores. We conclude that eigenvector centrality calculations reveal a robust evolutionary pattern of constraints-detectable by divergent algorithms--that occur at key protein locations. Finally, we discuss the fact that multiple patterns coexist in evolutionary data that, together, give rise to emergent protein functions.
Collapse
Affiliation(s)
- Daniel J Parente
- Department of Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, Kansas, 66160
| | - J Christian J Ray
- Center for Computational Biology and Department of Molecular Biosciences, University of Kansas, Lawrence, Kansas, 66047
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, University of Kansas Medical Center, Kansas City, Kansas, 66160
| |
Collapse
|
13
|
Abstract
We review literature on the metabolism of ribo- and deoxyribonucleotides, nucleosides, and nucleobases in Escherichia coli and Salmonella,including biosynthesis, degradation, interconversion, and transport. Emphasis is placed on enzymology and regulation of the pathways, at both the level of gene expression and the control of enzyme activity. The paper begins with an overview of the reactions that form and break the N-glycosyl bond, which binds the nucleobase to the ribosyl moiety in nucleotides and nucleosides, and the enzymes involved in the interconversion of the different phosphorylated states of the nucleotides. Next, the de novo pathways for purine and pyrimidine nucleotide biosynthesis are discussed in detail.Finally, the conversion of nucleosides and nucleobases to nucleotides, i.e.,the salvage reactions, are described. The formation of deoxyribonucleotides is discussed, with emphasis on ribonucleotidereductase and pathways involved in fomation of dUMP. At the end, we discuss transport systems for nucleosides and nucleobases and also pathways for breakdown of the nucleobases.
Collapse
|
14
|
Parente DJ, Swint-Kruse L. Multiple co-evolutionary networks are supported by the common tertiary scaffold of the LacI/GalR proteins. PLoS One 2013; 8:e84398. [PMID: 24391951 PMCID: PMC3877293 DOI: 10.1371/journal.pone.0084398] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2013] [Accepted: 11/15/2013] [Indexed: 11/18/2022] Open
Abstract
Protein families might evolve paralogous functions on their common tertiary scaffold in two ways. First, the locations of functionally-important sites might be "hard-wired" into the structure, with novel functions evolved by altering the amino acid (e.g. Ala vs Ser) at these positions. Alternatively, the tertiary scaffold might be adaptable, accommodating a unique set of functionally important sites for each paralogous function. To discriminate between these possibilities, we compared the set of functionally important sites in the six largest paralogous subfamilies of the LacI/GalR transcription repressor family. LacI/GalR paralogs share a common tertiary structure, but have low sequence identity (≤ 30%), and regulate a variety of metabolic processes. Functionally important positions were identified by conservation and co-evolutionary sequence analyses. Results showed that conserved positions use a mixture of the "hard-wired" and "accommodating" scaffold frameworks, but that the co-evolution networks were highly dissimilar between any pair of subfamilies. Therefore, the tertiary structure can accommodate multiple networks of functionally important positions. This possibility should be included when designing and interpreting sequence analyses of other protein families. Software implementing conservation and co-evolution analyses is available at https://sourceforge.net/projects/coevolutils/.
Collapse
Affiliation(s)
- Daniel J. Parente
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
- * E-mail:
| |
Collapse
|
15
|
Meinhardt S, Manley MW, Parente DJ, Swint-Kruse L. Rheostats and toggle switches for modulating protein function. PLoS One 2013; 8:e83502. [PMID: 24386217 PMCID: PMC3875437 DOI: 10.1371/journal.pone.0083502] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2013] [Accepted: 11/03/2013] [Indexed: 01/08/2023] Open
Abstract
The millions of protein sequences generated by genomics are expected to transform protein engineering and personalized medicine. To achieve these goals, tools for predicting outcomes of amino acid changes must be improved. Currently, advances are hampered by insufficient experimental data about nonconserved amino acid positions. Since the property “nonconserved” is identified using a sequence alignment, we designed experiments to recapitulate that context: Mutagenesis and functional characterization was carried out in 15 LacI/GalR homologs (rows) at 12 nonconserved positions (columns). Multiple substitutions were made at each position, to reveal how various amino acids of a nonconserved column were tolerated in each protein row. Results showed that amino acid preferences of nonconserved positions were highly context-dependent, had few correlations with physico-chemical similarities, and were not predictable from their occurrence in natural LacI/GalR sequences. Further, unlike the “toggle switch” behaviors of conserved positions, substitutions at nonconserved positions could be rank-ordered to show a “rheostatic”, progressive effect on function that spanned several orders of magnitude. Comparisons to various sequence analyses suggested that conserved and strongly co-evolving positions act as functional toggles, whereas other important, nonconserved positions serve as rheostats for modifying protein function. Both the presence of rheostat positions and the sequence analysis strategy appear to be generalizable to other protein families and should be considered when engineering protein modifications or predicting the impact of protein polymorphisms.
Collapse
Affiliation(s)
- Sarah Meinhardt
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Michael W. Manley
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Daniel J. Parente
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, United States of America
- * E-mail:
| |
Collapse
|
16
|
Gatti-Lafranconi P, Dijkman WP, Devenish SRA, Hollfelder F. A single mutation in the core domain of the lac repressor reduces leakiness. Microb Cell Fact 2013; 12:67. [PMID: 23834731 PMCID: PMC3722110 DOI: 10.1186/1475-2859-12-67] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 06/29/2013] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND The lac operon provides cells with the ability to switch from glucose to lactose metabolism precisely when necessary. This metabolic switch is mediated by the lac repressor (LacI), which in the absence of lactose binds to the operator DNA sequence to inhibit transcription. Allosteric rearrangements triggered by binding of the lactose isomer allolactose to the core domain of the repressor impede DNA binding and lift repression. In Nature, the ability to detect and respond to environmental conditions comes at the cost of the encoded enzymes being constitutively expressed at low levels. The readily-switched regulation provided by LacI has resulted in its widespread use for protein overexpression, and its applications in molecular biology represent early examples of synthetic biology. However, the leakiness of LacI that is essential for the natural function of the lac operon leads to an increased energetic burden, and potentially toxicity, in heterologous protein production. RESULTS Analysis of the features that confer promiscuity to the inducer-binding site of LacI identified tryptophan 220 as a target for saturation mutagenesis. We found that phenylalanine (similarly to tryptophan) affords a functional repressor that is still responsive to IPTG. Characterisation of the W220F mutant, LacIWF, by measuring the time dependence of GFP production at different IPTG concentrations and at various incubation temperatures showed a 10-fold reduction in leakiness and no decrease in GFP production. Cells harbouring a cytotoxic protein under regulatory control of LacIWF showed no decrease in viability in the early phases of cell growth. Changes in responsiveness to IPTG observed in vivo are supported by the thermal shift assay behaviour of purified LacIWF with IPTG and operator DNA. CONCLUSIONS In LacI, long-range communications are responsible for the transmission of the signal from the inducer binding site to the DNA binding domain and our results are consistent with the involvement of position 220 in modulating these. The mutation of this single tryptophan residue to phenylalanine generated an enhanced repressor with a 10-fold decrease in leakiness. By minimising the energetic burden and cytotoxicity caused by leakiness, LacIWF constitutes a useful switch for protein overproduction and synthetic biology.
Collapse
Affiliation(s)
| | - Willem P Dijkman
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Sean RA Devenish
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| |
Collapse
|
17
|
Meinhardt S, Manley MW, Becker NA, Hessman JA, Maher LJ, Swint-Kruse L. Novel insights from hybrid LacI/GalR proteins: family-wide functional attributes and biologically significant variation in transcription repression. Nucleic Acids Res 2012; 40:11139-54. [PMID: 22965134 PMCID: PMC3505978 DOI: 10.1093/nar/gks806] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
LacI/GalR transcription regulators have extensive, non-conserved interfaces between their regulatory domains and the 18 amino acids that serve as ‘linkers’ to their DNA-binding domains. These non-conserved interfaces might contribute to functional differences between paralogs. Previously, two chimeras created by domain recombination displayed novel functional properties. Here, we present a synthetic protein family, which was created by joining the LacI DNA-binding domain/linker to seven additional regulatory domains. Despite ‘mismatched’ interfaces, chimeras maintained allosteric response to their cognate effectors. Therefore, allostery in many LacI/GalR proteins does not require interfaces with precisely matched interactions. Nevertheless, the chimeric interfaces were not silent to mutagenesis, and preliminary comparisons suggest that the chimeras provide an ideal context for systematically exploring functional contributions of non-conserved positions. DNA looping experiments revealed higher order (dimer–dimer) oligomerization in several chimeras, which might be possible for the natural paralogs. Finally, the biological significance of repression differences was determined by measuring bacterial growth rates on lactose minimal media. Unexpectedly, moderate and strong repressors showed an apparent induction phase, even though inducers were not provided; therefore, an unknown mechanism might contribute to regulation of the lac operon. Nevertheless, altered growth correlated with altered repression, which indicates that observed functional modifications are significant.
Collapse
Affiliation(s)
- Sarah Meinhardt
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA
| | | | | | | | | | | |
Collapse
|
18
|
Kumaraswami M, Avanigadda L, Rai R, Park HW, Howe MM. Genetic analysis of phage Mu Mor protein amino acids involved in DNA minor groove binding and conformational changes. J Biol Chem 2011; 286:35852-35862. [PMID: 21859715 DOI: 10.1074/jbc.m111.269860] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Gene expression during lytic development of bacteriophage Mu occurs in three phases: early, middle, and late. Transcription from the middle promoter, P(m), requires the phage-encoded activator protein Mor and the bacterial RNA polymerase. The middle promoter has a -10 hexamer, but no -35 hexamer. Instead P(m) has a hyphenated inverted repeat that serves as the Mor binding site overlapping the position of the missing -35 element. Mor binds to this site as a dimer and activates transcription by recruiting RNA polymerase. The crystal structure of the His-Mor dimer revealed three structural elements: an N-terminal dimerization domain, a C-terminal helix-turn-helix DNA-binding domain, and a β-strand linker between the two domains. We predicted that the highly conserved residues in and flanking the β-strand would be essential for the conformational flexibility and DNA minor groove binding by Mor. To test this hypothesis, we carried out single codon-specific mutagenesis with degenerate oligonucleotides. The amino acid substitutions were identified by DNA sequencing. The mutant proteins were characterized for their overexpression, solubility, DNA binding, and transcription activation. This analysis revealed that the Gly-Gly motif formed by Gly-65 and Gly-66 and the β-strand side chain of Tyr-70 are crucial for DNA binding by His-tagged Mor. Mutant proteins with substitutions at Gly-74 retained partial activity. Treatment with the minor groove- and GC-specific chemical chromomycin A(3) demonstrated that chromomycin prevented His-Mor binding but could not disrupt a pre-formed His-Mor·DNA complex, consistent with the prediction that Mor interacts with the minor groove of the GC-rich spacer in the Mor binding site.
Collapse
Affiliation(s)
- Muthiah Kumaraswami
- Department of Microbiology, Immunology, and Biochemistry, University of Tennessee Health Science Center, Memphis, Tennessee 38163
| | - Lakshmi Avanigadda
- Department of Microbiology, Immunology, and Biochemistry, University of Tennessee Health Science Center, Memphis, Tennessee 38163
| | - Rajendra Rai
- Department of Microbiology, Immunology, and Biochemistry, University of Tennessee Health Science Center, Memphis, Tennessee 38163
| | - Hee-Won Park
- Department of Pharmacology, Structural Genomics Consortium, University of Toronto, Toronto, Ontario M5G1L7 Canada
| | - Martha M Howe
- Department of Microbiology, Immunology, and Biochemistry, University of Tennessee Health Science Center, Memphis, Tennessee 38163.
| |
Collapse
|
19
|
Tungtur S, Parente DJ, Swint-Kruse L. Functionally important positions can comprise the majority of a protein's architecture. Proteins 2011; 79:1589-608. [PMID: 21374721 DOI: 10.1002/prot.22985] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Revised: 12/08/2010] [Accepted: 12/15/2010] [Indexed: 01/13/2023]
Abstract
Concomitant with the genomic era, many bioinformatics programs have been developed to identify functionally important positions from sequence alignments of protein families. To evaluate these analyses, many have used the LacI/GalR family and determined whether positions predicted to be "important" are validated by published experiments. However, we previously noted that predictions do not identify all of the experimentally important positions present in the linker regions of these homologs. In an attempt to reconcile these differences, we corrected and expanded the LacI/GalR sequence set commonly used in sequence/function analyses. Next, a variety of analyses were carried out (1) for the entire LacI/GalR sequence set and (2) for a subset of homologs with functionally-important "YxPxxxAxxL" motifs in their linkers. This strategy was devised to determine whether predictions could be improved by knowledge-based sequence sorting and-for some analyses-did increase the number of linker positions identified. However, two functionally important linker positions were not reliably identified by any analysis. Finally, we compared the new predictions to all known experimental data for E. coli LacI and three homologous linkers. From these, we estimate that >50% of positions are important to the functions of the LacI/GalR homologs. In corollary, neutral positions might occur less frequently and might be easier to detect in sequence analyses. Although analyses have successfully guided mutations that partially exchange protein functions, a better experimental understanding of the sequence/function relationships in protein families would be helpful for uncovering the remaining rules used by nature to evolve new protein functions.
Collapse
Affiliation(s)
- Sudheer Tungtur
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, MSN 3030, Kansas City, Kansas 66160, USA
| | | | | |
Collapse
|
20
|
Comparing the functional roles of nonconserved sequence positions in homologous transcription repressors: implications for sequence/function analyses. J Mol Biol 2009; 395:785-802. [PMID: 19818797 DOI: 10.1016/j.jmb.2009.10.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2009] [Revised: 10/01/2009] [Accepted: 10/02/2009] [Indexed: 11/21/2022]
Abstract
The explosion of protein sequences deduced from genetic code has led to both a problem and a potential resource: Efficient data use requires interpreting the functional impact of sequence change without experimentally characterizing each protein variant. Several groups have hypothesized that interpretation could be aided by analyzing the sequences of naturally occurring homologues. To that end, myriad sequence/function analyses have been developed to predict which conserved, semi-conserved, and nonconserved positions are functionally important. These positions must be discriminated from the nonconserved positions that are functionally silent. However, the assumptions that underlie sequence analyses are based on experimental results that are sparse and usually designed to address different questions. Here, we use three homologues from a test family common to bioinformatics-the LacI/GalR transcription repressors-to test a common assumption: If a position is functionally important for one family member, it has similar importance in all homologues. We generated experimental sequence/function information for each nonconserved position in the 18 amino acids that link the DNA-binding and regulatory domains of three LacI/GalR homologues. We find that the functional importance of each position is preserved among the three linkers, albeit to different degrees. We also find that every linker position contributes to function, which has twofold implications. (1) Since the linker positions range from highly conserved to semi-conserved to nonconserved and contribute to affinity, selectivity, and allosteric response, we assert that sequence/function analyses must identify positions in the LacI/GalR linkers to be qualified as "successful". Many analyses overlook this region since most of the residues do not directly contact ligand. (2) No position in the LacI/GalR linker is functionally silent. This finding is inconsistent with another underlying principle of many analyses: Using sequence sets to discriminate important from non-contributing positions obligates silent positions, which denotes that most homologues tolerate a variety of amino acid substitutions at the position without functional change. Instead, additional combinatorial mutants in the LacI/GalR linkers show that particular substitutions can be silent in a context-dependent manner. Thus, specific permutations of sequence change (rather than change at silent positions) would facilitate neutral drift during evolution. Finally, the combinatorial mutants also reveal functional synergy between semi- and nonconserved positions. Such functional relationships would be missed by analyses that rely primarily upon co-evolution.
Collapse
|
21
|
Swint-Kruse L, Matthews KS. Allostery in the LacI/GalR family: variations on a theme. Curr Opin Microbiol 2009; 12:129-37. [PMID: 19269243 DOI: 10.1016/j.mib.2009.01.009] [Citation(s) in RCA: 115] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Revised: 01/22/2009] [Accepted: 01/26/2009] [Indexed: 12/21/2022]
Abstract
The lactose repressor protein (LacI) was among the very first genetic regulatory proteins discovered, and more than 1000 members of the bacterial LacI/GalR family are now identified. LacI has been the prototype for understanding how transcription is controlled using small metabolites to modulate protein association with specific DNA sites. This understanding has been greatly expanded by the study of other LacI/GalR homologues. A general picture emerges in which the conserved fold provides a scaffold for multiple types of interactions - including oligomerization, small molecule binding, and protein-protein binding - that in turn influence target DNA binding and thereby regulate mRNA production. Although many different functions have evolved from this basic scaffold, each homologue retains functional flexibility: For the same protein, different small molecules can have disparate impact on DNA binding and hence transcriptional outcome. In turn, binding to alternative DNA sequences may impact the degree of allosteric response. Thus, this family exhibits a symphony of variations by which transcriptional control is achieved.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, 66160, United States.
| | | |
Collapse
|
22
|
Meinhardt S, Swint-Kruse L. Experimental identification of specificity determinants in the domain linker of a LacI/GalR protein: bioinformatics-based predictions generate true positives and false negatives. Proteins 2008; 73:941-57. [PMID: 18536016 DOI: 10.1002/prot.22121] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
In protein families, conserved residues often contribute to a common general function, such as DNA-binding. However, unique attributes for each homolog (e.g. recognition of alternative DNA sequences) must arise from variation in other functionally-important positions. The locations of these "specificity determinant" positions are obscured amongst the background of varied residues that do not make significant contributions to either structure or function. To isolate specificity determinants, a number of bioinformatics algorithms have been developed. When applied to the LacI/GalR family of transcription regulators, several specificity determinants are predicted in the 18 amino acids that link the DNA-binding and regulatory domains. However, results from alternative algorithms are only in partial agreement with each other. Here, we experimentally evaluate these predictions using an engineered repressor comprising the LacI DNA-binding domain, the LacI linker, and the GalR regulatory domain (LLhG). "Wild-type" LLhG has altered DNA specificity and weaker lacO(1) repression compared to LacI or a similar LacI:PurR chimera. Next, predictions of linker specificity determinants were tested, using amino acid substitution and in vivo repression assays to assess functional change. In LLhG, all predicted sites are specificity determinants, as well as three sites not predicted by any algorithm. Strategies are suggested for diminishing the number of false negative predictions. Finally, individual substitutions at LLhG specificity determinants exhibited a broad range of functional changes that are not predicted by bioinformatics algorithms. Results suggest that some variants have altered affinity for DNA, some have altered allosteric response, and some appear to have changed specificity for alternative DNA ligands.
Collapse
Affiliation(s)
- Sarah Meinhardt
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas 66160, USA
| | | |
Collapse
|
23
|
Zhan H, Taraban M, Trewhella J, Swint-Kruse L. Subdividing repressor function: DNA binding affinity, selectivity, and allostery can be altered by amino acid substitution of nonconserved residues in a LacI/GalR homologue. Biochemistry 2008; 47:8058-69. [PMID: 18616293 DOI: 10.1021/bi800443k] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
Many mutations that impact protein function occur at residues that do not directly contact ligand. To understand the functional contributions from the sequence that links the DNA-binding and regulatory domains of the LacI/GalR homologues, we have created a chimeric protein (LLhP), which comprises the LacI DNA-binding domain, the LacI linker, and the PurR regulatory domain. Although DNA binding site residues are identical in LLhP and LacI, thermodynamic measurements of DNA binding affinity show that LLhP does not discriminate between alternative DNA ligands as well as LacI. In addition, small-angle scattering experiments show that LLhP is more compact than LacI. When DNA is released, LacI shows a 20 A increase in length that was previously attributed to unfolding of the linker. This change is not seen in apo-LLhP, even though the linker sequences of the two proteins are identical. Together, results indicate that long-range functional and structural changes are propagated across the interface that forms between the linker and regulatory domain. These changes could be mediated via the side chains of several linker residues that contact the regulatory domains of the naturally occurring proteins, LacI and PurR. Substitution of these residues in LLhP leads to a range of functional effects. Four variants exhibit altered affinity for DNA, with no changes in selectivity or allosteric response. Another two result in proteins that bind operator DNA with very low affinity and no allosteric response, similar to LacI binding nonspecific DNA sequences. Two more substitutions simultaneously diminish affinity, enhance allostery, and profoundly alter DNA ligand selectivity. Thus, positions within the linker can be varied to modulate different aspects of repressor function.
Collapse
Affiliation(s)
- Hongli Zhan
- Department of Biochemistry and Molecular Biology, MSN 3030, 3901 Rainbow Boulevard, The University of Kansas Medical Center, Kansas City, Kansas 66160, USA
| | | | | | | |
Collapse
|
24
|
|
25
|
Tungtur S, Egan SM, Swint-Kruse L. Functional consequences of exchanging domains between LacI and PurR are mediated by the intervening linker sequence. Proteins 2007; 68:375-88. [PMID: 17436321 PMCID: PMC2084478 DOI: 10.1002/prot.21412] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Homologue function can be differentiated by changing residues that affect binding sites or long-range interactions. LacI and PurR are two proteins that represent the LacI/GalR family (>500 members) of bacterial transcription regulators. All members have distinct DNA-binding and regulatory domains linked by approximately 18 amino acids. Each homologue has specificity for different DNA and regulatory effector ligands; LacI and PurR also exhibit differences in allosteric communication between DNA and effector binding sites. A comparative study of LacI and PurR suggested that alterations in the interface between the regulatory domain and linker are important for differentiating their functions. Four residues (equivalent to LacI positions 48, 55, 58, and 61) appear particularly important for creating a unique interface and were predicted to be necessary for allosteric regulation. However, nearby residues in the linker interact with DNA ligand. Thus, differences observed in interactions between linker and regulatory domain may be the cause of altered function or an effect of the two proteins binding different DNA ligands. To separate these possibilities, we created a chimeric protein with the LacI DNA-binding domain/linker and the PurR regulatory domain (LLhP). If the interface requires homologue-specific interactions in order to propagate the signal from effector binding, then LLhP repression should not be allosterically regulated by effector binding. Experiments show that LLhP is capable of repression from lacO1 and, contrary to expectation, allosteric response is intact. Further, restoring the potential for PurR-like interactions via substitutions in the LLhP linker tends to diminish repression. These effects are especially pronounced for residues 58 and 61. Clearly, binding affinity of LLhP for the lacO1 DNA site is sensitive to long-range changes in the linker. This result also raises the possibility that mutations at positions 58 and 61 co-evolved with changes in the DNA-binding site. In addition, repression measured in the absence and presence of effector ligand shows that allosteric response increases for several LLhP variants with substitutions at positions 48 and 55. Thus, while side chain variation at these sites does not generally dictate the presence or absence of allostery, the nature of the amino acid can modulate the response to effector.
Collapse
Affiliation(s)
- Sudheer Tungtur
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas 66160
| | - Susan M. Egan
- Department of Molecular Biosciences, The University of Kansas–Lawrence, Lawrence, Kansas 66045
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas 66160
- *Correspondence to: Liskin Swint-Kruse, Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas 66160. E-mail:
| |
Collapse
|
26
|
Chiu HC, Chang CA, Hu YJ. Prediction of orthologous relationship by functionally important sites. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2005; 78:209-22. [PMID: 15899306 DOI: 10.1016/j.cmpb.2005.03.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/19/2004] [Revised: 03/01/2005] [Accepted: 03/03/2005] [Indexed: 05/02/2023]
Abstract
Making accurate functional predictions plays an important role in the era of proteomics. Reliable functional information can be extracted from orthologs in other species when annotating an unknown gene. Here a site-based approach called PORFIS is proposed to predict orthologous relationship. When applied to the bacterial transcription factor PurR/LacI family and the protein kinase AGC family, our method was able to identify, with few false positives, the important sites that agree with those verified by biological experiments. We also tested it on the alpha-proteasome family, the glycoprotein hormone family and the growth hormone family to demonstrate its ability to predict orthologous relationship. Compared with other prediction methods based on phylogenetic analysis or hidden Markov models, PORFIS not only has competitive prediction accuracy, but also provides valuable biological information of functionally important sites associated with orthologs which can be further studied in biological experiments.
Collapse
Affiliation(s)
- Hsuan-Chao Chiu
- Department of Computer and Information Science, National Chiao Tung University, 1001 Ta Shueh Rd., Hsinchu, Taiwan
| | | | | |
Collapse
|
27
|
Devroede N, Thia-Toong TL, Gigot D, Maes D, Charlier D. Purine and pyrimidine-specific repression of the Escherichia coli carAB operon are functionally and structurally coupled. J Mol Biol 2004; 336:25-42. [PMID: 14741201 DOI: 10.1016/j.jmb.2003.12.024] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Transcription of the carAB operon encoding the sole carbamoylphosphate synthetase of Escherichia coli proceeds from a tandem pair of promoters. P2, downstream, is repressed by arginine and the ArgR protein, whereas P1 is submitted to pyrimidine-specific regulation and as shown here to purine-specific control exerted by binding of the PurR protein to a PUR box sequence centered around nucleotide -128.5 with respect to the start of P1 transcription. In vivo analyses of the effects of trans and cis-acting mutations on the regulatory responses and single round in vitro transcription assays indicated that ligand-bound PurR is by itself unable to inhibit P1 promoter activity. To exert its effect PurR relies on the elaborated nucleoprotein complex that governs P1 activity in a pyrimidine-specific manner. Thus we reveal the existence of an unprecedented functional and structural coupling between the modulation of P1 activity by purine and pyrimidine residues that appears to result from the unique position of the PUR box in the carAB control region, far upstream of the promoter. Missing contact and premethylation binding interference studies revealed the importance of base-specific groups and of structural aspects of the PUR box sequence in complex formation. Permutation assays indicated that the overall PurR-induced bending of the carAB control region is slightly less pronounced than that of the purF operator. The PUR boxes of the carAB operon of E.coli and Salmonella typhimurium are unique in that they have a guanine residue at position eight. Interestingly, guanine at this position has been proposed to be extremely unfavorable on the basis of modeling and binding studies, as its exocyclic amino group would enter into a steric clash with the side-chain of lysine 55. To analyze the effect of guanine at position eight in the upstream half-site of the carAB operator we constructed the adenine derivative and assayed in vivo repressibility of P1 promoter activity and in vitroPurR binding to the mutant operator, and constructed a molecular model for the unusual lysine 55-guanine 8 interaction.
Collapse
Affiliation(s)
- Neel Devroede
- Erfelijkheidsleer en Microbiologie, Vrije Universiteit Brussel (VUB), Pleinlaan 2, B-1050 Brussels, Belgium
| | | | | | | | | |
Collapse
|
28
|
Lamoureux JS, Maynes JT, Glover JNM. Recognition of 5'-YpG-3' sequences by coupled stacking/hydrogen bonding interactions with amino acid residues. J Mol Biol 2004; 335:399-408. [PMID: 14672650 DOI: 10.1016/j.jmb.2003.10.071] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The combined biochemical and structural study of hundreds of protein-DNA complexes has indicated that sequence-specific interactions are mediated by two mechanisms termed direct and indirect readout. Direct readout involves direct interactions between the protein and base-specific atoms exposed in the major and minor grooves of DNA. For indirect readout, the protein recognizes DNA by sensing conformational variations in the structure dependent on nucleotide sequence, typically through interactions with the phosphodiester backbone. Based on our recent structure of Ndt80 bound to DNA in conjunction with a search of the existing PDB database, we propose a new method of sequence-specific recognition that utilizes both direct and indirect readout. In this mode, a single amino acid side-chain recognizes two consecutive base-pairs. The 3'-base is recognized by canonical direct readout, while the 5'-base is recognized through a variation of indirect readout, whereby the conformational flexibility of the particular dinucleotide step, namely a 5'-pyrimidine-purine-3' step, facilitates its recognition by the amino acid via cation-pi interactions. In most cases, this mode of DNA recognition helps explain the sequence specificity of the protein for its target DNA.
Collapse
Affiliation(s)
- Jason S Lamoureux
- Department of Biochemistry, University of Alberta, Edmonton, Alta., Canada T6G 2H7
| | | | | |
Collapse
|
29
|
Lin T, Cavarelli J, Johnson JE. Evidence for assembly-dependent folding of protein and RNA in an icosahedral virus. Virology 2003; 314:26-33. [PMID: 14517057 DOI: 10.1016/s0042-6822(03)00457-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Ordered nucleic acid in an icosahedral virus was first visualized in the X-ray structure of the Picorna-like plant virus, Bean pod mottle virus (BPMV). Virus particles containing the 3500 nucleotide segment of the BPMV bipartite RNA genome (middle component) had nearly 20% of the genome ordered. Here we report the refined structures of the middle component, bottom component (particles containing the 5800 nucleotide segment of the genome), and top component (empty particles of BPMV capsid protein). The bottom component particles contain ordered RNA in the same location as middle component. Although the ordered RNA density in both nucleoprotein particles is the average of the contents of 60 icosahedral asymmetric units, both nucleoprotein components show that the base density for the first two nucleotides is predominantly purine, while the next five appear to be predominantly pyrimidine. The empty capsid demonstrates that RNA dictates the order of the N-terminal 19 residues of the large subunit because these residues are invisible in the top component.
Collapse
Affiliation(s)
- Tianwei Lin
- Department of Molecular Biology and Center for Integrative Molecular Biosciences, The Scripps Research Institute, 10550 N. Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | |
Collapse
|
30
|
Mirny LA, Gelfand MS. Using orthologous and paralogous proteins to identify specificity-determining residues in bacterial transcription factors. J Mol Biol 2002; 321:7-20. [PMID: 12139929 DOI: 10.1016/s0022-2836(02)00587-9] [Citation(s) in RCA: 117] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Concepts of orthology and paralogy are become increasingly important as whole-genome comparison allows their identification in complete genomes. Functional specificity of proteins is assumed to be conserved among orthologs and is different among paralogs. We used this assumption to identify residues which determine specificity of protein-DNA and protein-ligand recognition. Finding such residues is crucial for understanding mechanisms of molecular recognition and for rational protein and drug design. Assuming conservation of specificity among orthologs and different specificity of paralogs, we identify residues that correlate with this grouping by specificity. The method is taking advantage of complete genomes to find multiple orthologs and paralogs. The central part of this method is a procedure to compute statistical significance of the predictions. The procedure is based on a simple statistical model of protein evolution. When applied to a large family of bacterial transcription factors, our method identified 12 residues that are presumed to determine the protein-DNA and protein-ligand recognition specificity. Structural analysis of the proteins and available experimental results strongly support our predictions. Our results suggest new experiments aimed at rational re-design of specificity in bacterial transcription factors by a minimal number of mutations.
Collapse
Affiliation(s)
- Leonid A Mirny
- Harvard-MIT Division of Health Science and Technology, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge 02139, USA.
| | | |
Collapse
|
31
|
Swint-Kruse L, Larson C, Pettitt BM, Matthews KS. Fine-tuning function: correlation of hinge domain interactions with functional distinctions between LacI and PurR. Protein Sci 2002; 11:778-94. [PMID: 11910022 PMCID: PMC2373529 DOI: 10.1110/ps.4050102] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
LacI and PurR are highly homologous proteins. Their functional units are homodimers, with an N-terminal DNA binding domain that comprises the helix-turn-helix (HTH), N-linker, and hinge regions from both monomers. Hinge structural changes are known to occur upon DNA dissociation but are difficult to monitor experimentally. The initial steps of hinge unfolding were therefore examined using molecular dynamics simulations, utilizing a truncated, chimeric protein comprising the LacI HTH/N-linker and PurR hinge. A terminal Gly-Cys-Gly was added to allow "dimerization" through disulfide bond formation. Simulations indicate that differences in LacI and PurR hinge primary sequence affect the quaternary structure of the hinge x hinge' interface. However, these alternate hinge orientations would be sterically restricted by the core domain. These results prompted detailed comparison of recently available DNA-bound structures for LacI and truncated LacI(1-62) with the PurR structure. Examination revealed that different N-linker and hinge contacts to the core domain of the partner monomer (which binds effector molecule) affect the juxtapositions of the HTH, N-linker, and hinge regions in the DNA binding domain. In addition, the two full-length repressors exhibit significant differences in the interactions between the core and the C-linker connection to the DNA binding domain. Both linkers and the hinge have been implicated in the allosteric response of these repressors. Intriguingly, one functional difference between these two proteins is that they exhibit opposite allosteric response to effector. Simulations and observed structural distinctions are correlated with mutational analysis and sequence information from the LacI/GalR family to formulate a mechanism for fine-tuning individual repressor function.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas 77005, USA.
| | | | | | | |
Collapse
|
32
|
Dunten P, Belunis C, Crowther R, Hollfelder K, Kammlott U, Levin W, Michel H, Ramsey GB, Swain A, Weber D, Wertheimer SJ. Crystal structure of human cytosolic phosphoenolpyruvate carboxykinase reveals a new GTP-binding site. J Mol Biol 2002; 316:257-64. [PMID: 11851336 DOI: 10.1006/jmbi.2001.5364] [Citation(s) in RCA: 63] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We report crystal structures of the human enzyme phosphoenolpyruvate carboxykinase (PEPCK) with and without bound substrates. These structures are the first to be determined for a GTP-dependent PEPCK, and provide the first view of a novel GTP-binding site unique to the GTP-dependent PEPCK family. Three phenylalanine residues form the walls of the guanine-binding pocket on the enzyme's surface and, most surprisingly, one of the phenylalanine side-chains contributes to the enzyme's specificity for GTP. PEPCK catalyzes the rate-limiting step in the metabolic pathway that produces glucose from lactate and other precursors derived from the citric acid cycle. Because the gluconeogenic pathway contributes to the fasting hyperglycemia of type II diabetes, inhibitors of PEPCK may be useful in the treatment of diabetes.
Collapse
Affiliation(s)
- Pete Dunten
- Roche Research Center, Hoffmann-La Roche Inc., Nutley, NJ 07110, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
33
|
Moraitis MI, Xu H, Matthews KS. Ion concentration and temperature dependence of DNA binding: comparison of PurR and LacI repressor proteins. Biochemistry 2001; 40:8109-17. [PMID: 11434780 DOI: 10.1021/bi0028643] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Purine repressor (PurR) binding to specific DNA is enhanced by complexing with purines, whereas lactose repressor (LacI) binding is diminished by interaction with inducer sugars despite 30% identity in their protein sequences and highly homologous tertiary structures. Nonetheless, in switching from low- to high-affinity DNA binding, these proteins undergo a similar structural change in which the hinge region connecting the DNA and effector binding domains folds into an alpha-helix and contacts the DNA minor groove. The differences in response to effector for these proteins should be manifest in the polyelectrolyte effect which arises from cations displaced from DNA by interaction with positively charged side chains on a protein and is quantitated by measurement of DNA binding affinity as a function of ion concentration. Consistent with structural data for these proteins, high-affinity operator DNA binding by the PurR-purine complex involved approximately 15 ion pairs, a value significantly greater than that for the corresponding state of LacI (approximately 6 ion pairs). For both proteins, however, conversion to the low-affinity state results in a decrease of approximately 2-fold in the number of cations released per dimeric DNA binding site. Heat capacity changes (DeltaC(p)) that accompany DNA binding, derived from buried apolar surface area, coupled folding, and restriction of motional freedom of polar groups in the interface, also reflect the differences between these homologous repressor proteins. DNA binding of the PurR-guanine complex is accompanied by a DeltaC(p) (-2.8 kcal mol(-1) K(-1)) more negative than that observed previously for LacI (-0.9 to -1.5 kcal mol(-1) K(-1)), suggesting that more extensive protein folding and/or enhanced structural rigidity may occur upon DNA binding for PurR compared to DNA binding for LacI. The differences between these proteins illustrate plasticity of function despite high-level sequence and structural homology and undermine efforts to predict protein behavior on the basis of such similarities.
Collapse
Affiliation(s)
- M I Moraitis
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas 77005, USA
| | | | | |
Collapse
|
34
|
Pabo CO, Nekludova L. Geometric analysis and comparison of protein-DNA interfaces: why is there no simple code for recognition? J Mol Biol 2000; 301:597-624. [PMID: 10966773 DOI: 10.1006/jmbi.2000.3918] [Citation(s) in RCA: 198] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Structural studies of protein-DNA complexes have shown that there are many distinct families of DNA-binding proteins, and have shown that there is no simple "code" describing side-chain/base interactions. However, systematic analysis and comparison of protein-DNA complexes has been complicated by the diversity of observed contacts, the sheer number of complexes currently available and the absence of any consistent method of comparison that retains detailed structural information about the protein-DNA interface. To address these problems, we have developed geometric methods for characterizing the local structural environment in which particular side-chain/base interactions are observed. In particular, we develop methods for analyzing and comparing spatial relationships at the protein-DNA interface. Our method involves attaching local coordinate systems to the DNA bases and to the C(alpha) atoms of the peptide backbone (these are relatively rigid structural units). We use these tools to consider how the position and orientation of the polypeptide backbone (with respect to the DNA) helps to determine what contacts are possible at any given position in a protein-DNA complex. Here, we focus on base contacts that are made in the major groove, and we use spatial relationships in analyzing: (i) the observed patterns of side-chain/base interactions; (ii) observed helix docking orientations; (iii) family/subfamily relationships among DNA-binding proteins; and (iv) broader questions about evolution, altered specificity mutants and the limits for the design of new DNA-binding proteins. Our analysis, which highlights differences in spatial relationships in different complexes and at different positions in a complex, helps explain why there is no simple, general code for protein-DNA recognition.
Collapse
Affiliation(s)
- C O Pabo
- Howard Hughes Medical Institute, Department of Biology 68-580, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. pabo@,it.edu
| | | |
Collapse
|
35
|
Pérez-Rueda E, Collado-Vides J. The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12. Nucleic Acids Res 2000; 28:1838-47. [PMID: 10734204 PMCID: PMC102813 DOI: 10.1093/nar/28.8.1838] [Citation(s) in RCA: 218] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Using a combination of several approaches we estimated and characterized a total of 314 regulatory DNA-binding proteins in Escherichia coli, which might represent its minimal set of transcription factors. The collection is comprised of 35% activators, 43% repressors and 22% dual regulators. Within many regulatory protein families, the members are homogeneous in their regulatory roles, physiology of regulated genes, regulatory function, length and genome position, showing that these families have evolved homogeneously in prokaryotes, particularly in E.coli. This work describes a full characterization of the repertoire of regulatory interactions in a whole living cell. This repertoire should contribute to the interpretation of global gene expression profiles in both prokaryotes and eukaryotes.
Collapse
Affiliation(s)
- E Pérez-Rueda
- Programa de Biología Molecular Computacional, Centro de Investigación sobre Fijación de Nitrógeno, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, AP 565-A 62110, Mexico
| | | |
Collapse
|
36
|
Johnson JM, Church GM. Predicting ligand-binding function in families of bacterial receptors. Proc Natl Acad Sci U S A 2000; 97:3965-70. [PMID: 10737762 PMCID: PMC18125 DOI: 10.1073/pnas.050580897] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The three-dimensional fold of a new protein sequence can often be inferred directly from sequence homology to a protein of known structure. The function of a new protein sequence is more difficult to predict, however, since homologues can have different molecular and cellular functions. To develop and automate computational methods for determining molecular function, we have analyzed ligand-binding specificity in two related families of binding proteins. One of these families includes Escherichia coli lactose repressor and ribose-binding protein, and the other includes E. coli sulfate- and phosphate-binding proteins. These proteins have similar folds but varying specificity, binding many different small molecules, including mono- and disaccharides, purines, oxyanions, ferric iron, and polyamines. Starting from template structural alignments, alignments of over 90 sequences per family were generated by iterative database searches with hidden Markov models. Phylogenetic trees were made of full-length sequences and of subsets of residues lining the binding cleft, to determine whether subbranches of the trees correlate with ligand-binding preference. Automated analyses of residues in the binding pocket were also used to predict ligand-binding function for many uncharacterized database sequences and to identify specific side chain-ligand contacts in proteins without solved structures. Our results demonstrate the utility of anchoring functional annotation within a protein family context.
Collapse
Affiliation(s)
- J M Johnson
- Graduate Program in Biophysics and Department of Genetics, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| | | |
Collapse
|
37
|
Erlandsen H, Bjørgo E, Flatmark T, Stevens RC. Crystal structure and site-specific mutagenesis of pterin-bound human phenylalanine hydroxylase. Biochemistry 2000; 39:2208-17. [PMID: 10694386 DOI: 10.1021/bi992531+] [Citation(s) in RCA: 86] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The crystal structure of the dimeric catalytic domain (residues 118-424) of human PheOH (hPheOH), cocrystallized with the oxidized form of the cofactor (7,8-dihydro-L-biopterin, BH(2)), has been determined at 2.0 A resolution. The pterin binds in the second coordination sphere of the catalytic iron (the C4a atom is 6.1 A away), and interacts through several hydrogen bonds to two water molecules coordinated to the iron, as well as to the main chain carbonyl oxygens of Ala322, Gly247, and Leu249 and the main chain amide of Leu249. Some important conformational changes are seen in the active site upon pterin binding. The loop between residues 245 and 250 moves in the direction of the iron, and thus allows for several important hydrogen bonds to the pterin ring to be formed. The pterin cofactor is in an ideal orientation for dioxygen to bind in a bridging position between the iron and the pterin. The pterin ring forms an aromatic pi-stacking interaction with Phe254, and Tyr325 contributes to the positioning of the pterin ring and its dihydroxypropyl side chain by hydrophobic interactions. Of particular interest in the hPheOH x BH(2) binary complex structure is the finding that Glu286 hydrogen bonds to one of the water molecules coordinated to the iron as well as to a water molecule which hydrogen bonds to N3 of the pterin ring. Site-specific mutations of Glu286 (E286A and E286Q), Phe254 (F254A and F254L), and Tyr325 (Y325F) have confirmed the important contribution of Glu286 and Phe254 to the normal positioning of the pterin cofactor and catalytic activity of hPheOH. Tyr325 also contributes to the correct positioning of the pterin, but has no direct function in the catalytic reaction, in agreement with the results obtained with rat TyrOH [Daubner, S. C., and Fitzpatrick, P. F. (1998) Biochemistry 37, 16440-16444]. Superposition of the binary hPheOH.BH(2) complex onto the crystal structure of the ligand-free rat PheOH (which contains the regulatory and catalytic domains) [Kobe, B., Jennings, I. G., House, C. M., Michell, B. J., Goodwill, K. E., Santarsiero, B. D., Stevens, R. C., Cotton, R. G. H., and Kemp, B. E. (1999) Nat. Struct. Biol. 6, 442-448] reveals that the C2'-hydroxyl group of BH(2) is sufficiently close to form hydrogen bonds to Ser23 in the regulatory domain. Similar interactions are seen with the hPheOH.adrenaline complex and Ser23. These interactions suggest a structural explanation for the specific regulatory properties of the dihydroxypropyl side chain of BH(4) (negative effector) in the full-length enzyme in terms of phosphorylation of Ser16 and activation by L-Phe.
Collapse
Affiliation(s)
- H Erlandsen
- Departments of Molecular Biology and Chemistry, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, California 92037, USA
| | | | | | | |
Collapse
|
38
|
Abstract
On the basis of a structural analysis of 240 protein-DNA complexes contained in the Protein Data Bank (PDB), we have classified the DNA-binding proteins involved into eight different structural/functional groups, which are further classified into 54 structural families. Here we present this classification and review the functions, structures and binding interactions of these protein-DNA complexes.
Collapse
Affiliation(s)
- N M Luscombe
- Biomolecular Structure and Modelling Unit, Department of Biochemistry and Molecular Biology, University College, Gower Street, London WC1E 6BT, UK.
| | | | | | | |
Collapse
|
39
|
Glasfeld A, Koehler AN, Schumacher MA, Brennan RG. The role of lysine 55 in determining the specificity of the purine repressor for its operators through minor groove interactions. J Mol Biol 1999; 291:347-61. [PMID: 10438625 DOI: 10.1006/jmbi.1999.2946] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The interaction of the dimeric Escherichia coli purine repressor (PurR) with its cognate sequences leads to a 45 degrees to 50 degrees kink at a central CpG base step towards the major groove, as dyad-related leucine side-chains interdigitate between these bases from the minor groove. The resulting broadening of the minor groove increases the accessibility of the six central base-pairs towards minor groove interactions with residues from PurR. It has been shown that lysine 55 of PurR makes a direct contact with the adenine base (Ade8) directly 5' to the central CpG base-pair step in the high-affinity purF operator sequence. We have investigated the importance of this interaction in the specificity and affinity of wild-type PurR (WT) for its operators and we have studied a mutant of PurR in which Lys55 is replaced with alanine (K55A). Complexes of WT and K55A with duplex DNA containing pur operator sequences varied at position 8 were investigated crystallographically, and binding studies were performed using fluorescence anisotropy. The structures of the protein-DNA complexes reveal a relatively unperturbed global conformation regardless of the identity of the base-pair at position 8 or residue 55. In all structures the combination of higher resolution and a palindromic purF operator site allowed several new PurR.DNA interactions to be observed, including contacts by Thr15, Thr16 and His20. The side-chain of Lys55 makes productive, though varying, interactions with the adenine, thymine or cytosine base at position 8 that result in equilibrium dissociation constants of 2.6 nM, 10 nM and 35 nM, respectively. However, the bulk of the lysine side-chain apparently blocks high-affinity binding of operators with guanine at position 8 (Kd620 nM). Also, the high-affinity binding conformation appears blocked, as crystals of WT bound to DNA with guanine at position 8 could not be grown. In complexes containing K55A, the alanine side-chain is too far removed to engage in van der Waals interactions with the operator, and, with the loss of the general electrostatic interaction between the phosphate backbone and the ammonium group of lysine, K55A binds each operator weakly. However, the mutation leads to a swap of specificity of PurR for the base at position 8, with K55A exhibiting a twofold preference for guanine over adenine. In addition to defining the role of Lys55 in PurR minor groove binding, these studies provide structural insight into the minor groove binding specificities of other LacI/GalR family members that have either alanine (e.g. LacI, GalR, CcpA) or a basic residue (e.g. RafR, ScrR, RbtR) at the comparable position.
Collapse
Affiliation(s)
- A Glasfeld
- Department of Biochemistry and Molecular Biology, Oregon Health Sciences University, Portland, OR, 97201-3098, USA
| | | | | | | |
Collapse
|
40
|
Nadassy K, Wodak SJ, Janin J. Structural features of protein-nucleic acid recognition sites. Biochemistry 1999; 38:1999-2017. [PMID: 10026283 DOI: 10.1021/bi982362d] [Citation(s) in RCA: 246] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
We analyzed the atomic models of 75 X-ray structures of protein-nucleic acid complexes with the aim of uncovering common properties. The interface area measured the extent of contact between the protein and nucleic acid. It was found to vary between 1120 and 5800 A2. Despite this wide variation, the interfaces in complexes of transcription factors with double-stranded DNA could be broken up into recognition modules where 12 +/- 3 nucleotides on the DNA side contact 24 +/- 6 amino acids on the protein side, with interface areas in the range 1600 +/- 400 A2. For enzymes acting on DNA, the recognition module is on average 600 A2 larger, due to the requirement of making an active site. As judged by its chemical and amino acid composition, the average protein surface in contact with the DNA is more polar than the solvent accessible surface or the typical protein-protein interface. The protein side is rich in positively charged groups from lysine and arginine side chains; on the DNA side the negative charges from phosphate groups dominate. Hydrogen bonding patterns were also analyzed, and we found one intermolecular hydrogen bond per 125 A2 of interface area in high-resolution structures. An equivalent number of polar interactions involved water molecules, which are generally abundant at protein-DNA interfaces. Calculations of Voronoi atomic volumes, performed in the presence and absence of water molecules, showed that protein atoms buried at the interface with DNA are on average as closely packed as in the protein interior. Water molecules contribute to the close packing, thereby mediating shape complementarity. Finally, conformational changes accompanying association were analyzed in 24 of the complexes for which the structure of the free protein was also available. On the DNA side the extent of deformation showed some correlation with the size of the interface area. On the protein side the type and size of the structural changes spanned a wide spectrum. Disorder-to-order transitions, domain movements, quaternary and tertiary changes were observed, and the largest changes occurred in complexes with large interfaces.
Collapse
Affiliation(s)
- K Nadassy
- European Bioinformatics Institute, EMBL, Wellcome Trust Genome Campus, Cambridge, England
| | | | | |
Collapse
|
41
|
Hars U, Horlacher R, Boos W, Welte W, Diederichs K. Crystal structure of the effector-binding domain of the trehalose-repressor of Escherichia coli, a member of the LacI family, in its complexes with inducer trehalose-6-phosphate and noninducer trehalose. Protein Sci 1998; 7:2511-21. [PMID: 9865945 PMCID: PMC2143882 DOI: 10.1002/pro.5560071204] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The crystal structure of the Escherichia coli trehalose repressor (TreR) in a complex with its inducer trehalose-6-phosphate was determined by the method of multiple isomorphous replacement (MIR) at 2.5 A resolution, followed by the structure determination of TreR in a complex with its noninducer trehalose at 3.1 A resolution. The model consists of residues 61 to 315 comprising the effector binding domain, which forms a dimer as in other members of the LacI family. This domain is composed of two similar subdomains each consisting of a central beta-sheet sandwiched between alpha-helices. The effector binding pocket is at the interface of these subdomains. In spite of different physiological functions, the crystal structures of the two complexes of TreR turned out to be virtually identical to each other with the conformation being similar to those of the effector binding domains of the LacI and PurR in complex with their effector molecules. According to the crystal structure, the noninducer trehalose binds to a similar site as the trehalose portion of trehalose-6-phosphate. The binding affinity for the former is lower than for the latter. The noninducer trehalose thus binds competitively to the repressor. Unlike the phosphorylated inducer molecule, it is incapable of blocking the binding of the repressor headpiece to its operator DNA. The ratio of the concentrations of trehalose-6-phosphate and trehalose thus is used to switch between the two alternative metabolic uses of trehalose as an osmoprotectant and as a carbon source.
Collapse
Affiliation(s)
- U Hars
- Department of Biology, University of Konstanz, Germany
| | | | | | | | | |
Collapse
|
42
|
Schumacher MA, Carter D, Scott DM, Roos DS, Ullman B, Brennan RG. Crystal structures of Toxoplasma gondii uracil phosphoribosyltransferase reveal the atomic basis of pyrimidine discrimination and prodrug binding. EMBO J 1998; 17:3219-32. [PMID: 9628859 PMCID: PMC1170660 DOI: 10.1093/emboj/17.12.3219] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Uracil phosphoribosyltransferase (UPRTase) catalyzes the transfer of a ribosyl phosphate group from alpha-D-5-phosphoribosyl-1-pyrophosphate to the N1 nitrogen of uracil. The UPRTase from the opportunistic pathogen Toxoplasma gondii is a rational target for antiparasitic drug design. To aid in structure-based drug design studies against toxoplasmosis, the crystal structures of the T.gondii apo UPRTase (1.93 A resolution), the UPRTase bound to its substrate, uracil (2.2 A resolution), its product, UMP (2.5 A resolution), and the prodrug, 5-fluorouracil (2.3 A resolution), have been determined. These structures reveal that UPRTase recognizes uracil through polypeptide backbone hydrogen bonds to the uracil exocyclic O2 and endocyclic N3 atoms and a backbone-water-exocyclic O4 oxygen hydrogen bond. This stereochemical arrangement and the architecture of the uracil-binding pocket reveal why cytosine and pyrimidines with exocyclic substituents at ring position 5 larger than fluorine, including thymine, cannot bind to the enzyme. Strikingly, the T. gondii UPRTase contains a 22 residue insertion within the conserved PRTase fold that forms an extended antiparallel beta-arm. Leu92, at the tip of this arm, functions to cap the active site of its dimer mate, thereby inhibiting the escape of the substrate-binding water molecule.
Collapse
Affiliation(s)
- M A Schumacher
- Department of Biochemistry and Molecular Biology, Oregon Health Sciences University, Portland, OR 97201-3098, USA
| | | | | | | | | | | |
Collapse
|
43
|
Arvidson DN, Lu F, Faber C, Zalkin H, Brennan RG. The structure of PurR mutant L54M shows an alternative route to DNA kinking. NATURE STRUCTURAL BIOLOGY 1998; 5:436-41. [PMID: 9628480 DOI: 10.1038/nsb0698-436] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The crystal structure of the purine repressor mutant L54M bound to hypoxanthine and to the purF operator provides a stereochemical understanding of the high DNA affinity of this hinge helix mutant. Comparison of the PurR L54M-DNA complex to that of the wild type PurR-DNA complex reveals that these purine repressors bind and kink DNA similarly despite significant differences in their minor groove contacts and routes to interdigitation of the central C.G:G.C base pair step. Modeling studies, supported by genetic and biochemical data, show that the stereochemistry of the backbone atoms of the abutting hinge helices combined with the rigidity of the kinked base pair step constrain the interdigitating residue to leucine or methionine for the LacI/GalR family of transcription regulators.
Collapse
Affiliation(s)
- D N Arvidson
- Department of Biochemistry and Molecular Biology, Oregon Health Sciences University, Portland 97201-3098, USA
| | | | | | | | | |
Collapse
|
44
|
Lu F, Schumacher MA, Arvidson DN, Haldimann A, Wanner BL, Zalkin H, Brennan RG. Structure-based redesign of corepressor specificity of the Escherichia coli purine repressor by substitution of residue 190. Biochemistry 1998; 37:971-82. [PMID: 9454587 DOI: 10.1021/bi971942s] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Guanine or hypoxanthine, physiological corepressors of the Escherichia coli purine repressor (PurR), promote formation of the ternary PurR-corepressor-operator DNA complex that functions to repress pur operon gene expression. Structure-based predictions on the importance of Arg190 in determining 6-oxopurine specificity and corepressor binding affinity were tested by mutagenesis, analysis of in vivo function, and in vitro corepressor binding measurements. Replacements of Arg190 with Ala or Gln resulted in functional repressors in which binding of guanine and hypoxanthine was retained but specificity was relaxed to permit binding of adenine. X-ray structures were determined for ternary complexes of mutant repressors with purines (adenine, guanine, hypoxanthine, and 6-methylpurine) and operator DNA. These structures indicate that R190A binds guanine, hypoxanthine, and adenine with nearly equal, albeit reduced, affinity in large part because of a newly made compensatory hydrogen bond between the rotated hydroxyl side chain of Ser124 and the exocyclic 6 positions of the purines. Through direct and water-mediated contacts, the R190Q protein binds adenine with a nearly 75-fold higher affinity than the wild type repressor while maintaining wild type affinity for guanine and hypoxanthine. The results establish at the atomic level the basis for the critical role of Arg190 in the recognition of the exocyclic 6 position of its purine corepressors and the successful redesign of corepressor specificity.
Collapse
Affiliation(s)
- F Lu
- Department of Biochemistry, Purdue University, West Lafayette, Indiana 47907-1153, USA
| | | | | | | | | | | | | |
Collapse
|