1
|
O'Neil PT, Swint-Kruse L, Fenton AW. Rheostatic contributions to protein stability can obscure a position's functional role. Protein Sci 2024; 33:e5075. [PMID: 38895978 DOI: 10.1002/pro.5075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/24/2024] [Accepted: 05/27/2024] [Indexed: 06/21/2024]
Abstract
Rheostat positions, which can be substituted with various amino acids to tune protein function across a range of outcomes, are a developing area for advancing personalized medicine and bioengineering. Current methods cannot accurately predict which proteins contain rheostat positions or their substitution outcomes. To compare the prevalence of rheostat positions in homologs, we previously investigated their occurrence in two pyruvate kinase (PYK) isozymes. Human liver PYK contained numerous rheostat positions that tuned the apparent affinity for the substrate phosphoenolpyruvate (Kapp-PEP) across a wide range. In contrast, no functional rheostat positions were identified in Zymomonas mobilis PYK (ZmPYK). Further, the set of ZmPYK substitutions included an unusually large number that lacked measurable activity. We hypothesized that the inactive substitution variants had reduced protein stability, precluding detection of Kapp-PEP tuning. Using modified buffers, robust enzymatic activity was obtained for 19 previously-inactive ZmPYK substitution variants at three positions. Surprisingly, both previously-inactive and previously-active substitution variants all had Kapp-PEP values close to wild-type. Thus, none of the three positions were functional rheostat positions, and, unlike human liver PYK, ZmPYK's Kapp-PEP remained poorly tunable by single substitutions. To directly assess effects on stability, we performed thermal denaturation experiments for all ZmPYK substitution variants. Many diminished stability, two enhanced stability, and the three positions showed different thermal sensitivity to substitution, with one position acting as a "stability rheostat." The differences between the two PYK homologs raises interesting questions about the underlying mechanism(s) that permit functional tuning by single substitutions in some proteins but not in others.
Collapse
Affiliation(s)
- Pierce T O'Neil
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas, USA
| |
Collapse
|
2
|
Sreenivasan S, Heffren P, Suh K, Rodnin MV, Kosa E, Fenton AW, Ladokhin AS, Smith PE, Fontes JD, Swint‐Kruse L. The intrinsically disordered transcriptional activation domain of CIITA is functionally tuneable by single substitutions: An exception or a new paradigm? Protein Sci 2024; 33:e4863. [PMID: 38073129 PMCID: PMC10806935 DOI: 10.1002/pro.4863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 12/04/2023] [Accepted: 12/07/2023] [Indexed: 01/27/2024]
Abstract
During protein evolution, some amino acid substitutions modulate protein function ("tuneability"). In most proteins, the tuneable range is wide and can be sampled by a set of protein variants that each contains multiple amino acid substitutions. In other proteins, the full tuneable range can be accessed by a set of variants that each contains a single substitution. Indeed, in some globular proteins, the full tuneable range can be accessed by the set of site-saturating substitutions at an individual "rheostat" position. However, in proteins with intrinsically disordered regions (IDRs), most functional studies-which would also detect tuneability-used multiple substitutions or small deletions. In disordered transcriptional activation domains (ADs), studies with multiple substitutions led to the "acidic exposure" model, which does not anticipate the existence of rheostat positions. In the few studies that did assess effects of single substitutions on AD function, results were mixed: the ADs of two full-length transcription factors did not show tuneability, whereas a fragment of a third AD was tuneable by single substitutions. In this study, we tested tuneability in the AD of full-length human class II transactivator (CIITA). Sequence analyses and experiments showed that CIITA's AD is an IDR. Functional assays of singly-substituted AD variants showed that CIITA's function was highly tuneable, with outcomes not predicted by the acidic exposure model. Four tested positions showed rheostat behavior for transcriptional activation. Thus, tuneability of different IDRs can vary widely. Future studies are needed to illuminate the biophysical features that govern whether an IDR is tuneable by single substitutions.
Collapse
Affiliation(s)
- Shwetha Sreenivasan
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Paul Heffren
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
- Present address:
Department of BiosciencesKansas City UniversityKansas CityMissouriUSA
| | - Kyung‐Shin Suh
- Department of ChemistryKansas State UniversityManhattanKansasUSA
| | - Mykola V. Rodnin
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Edina Kosa
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Aron W. Fenton
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Alexey S. Ladokhin
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Paul E. Smith
- Department of ChemistryKansas State UniversityManhattanKansasUSA
| | - Joseph D. Fontes
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| | - Liskin Swint‐Kruse
- Department of Biochemistry and Molecular BiologyUniversity of Kansas Medical CenterKansas CityKansasUSA
| |
Collapse
|
3
|
Alexandari AM, Horton CA, Shrikumar A, Shah N, Li E, Weilert M, Pufall MA, Zeitlinger J, Fordyce PM, Kundaje A. De novo distillation of thermodynamic affinity from deep learning regulatory sequence models of in vivo protein-DNA binding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.11.540401. [PMID: 37214836 PMCID: PMC10197627 DOI: 10.1101/2023.05.11.540401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Transcription factors (TF) are proteins that bind DNA in a sequence-specific manner to regulate gene transcription. Despite their unique intrinsic sequence preferences, in vivo genomic occupancy profiles of TFs differ across cellular contexts. Hence, deciphering the sequence determinants of TF binding, both intrinsic and context-specific, is essential to understand gene regulation and the impact of regulatory, non-coding genetic variation. Biophysical models trained on in vitro TF binding assays can estimate intrinsic affinity landscapes and predict occupancy based on TF concentration and affinity. However, these models cannot adequately explain context-specific, in vivo binding profiles. Conversely, deep learning models, trained on in vivo TF binding assays, effectively predict and explain genomic occupancy profiles as a function of complex regulatory sequence syntax, albeit without a clear biophysical interpretation. To reconcile these complementary models of in vitro and in vivo TF binding, we developed Affinity Distillation (AD), a method that extracts thermodynamic affinities de-novo from deep learning models of TF chromatin immunoprecipitation (ChIP) experiments by marginalizing away the influence of genomic sequence context. Applied to neural networks modeling diverse classes of yeast and mammalian TFs, AD predicts energetic impacts of sequence variation within and surrounding motifs on TF binding as measured by diverse in vitro assays with superior dynamic range and accuracy compared to motif-based methods. Furthermore, AD can accurately discern affinities of TF paralogs. Our results highlight thermodynamic affinity as a key determinant of in vivo binding, suggest that deep learning models of in vivo binding implicitly learn high-resolution affinity landscapes, and show that these affinities can be successfully distilled using AD. This new biophysical interpretation of deep learning models enables high-throughput in silico experiments to explore the influence of sequence context and variation on both intrinsic affinity and in vivo occupancy.
Collapse
Affiliation(s)
- Amr M. Alexandari
- Department of Computer Science, Stanford University, Stanford, CA 94305
| | | | - Avanti Shrikumar
- Department of Earth System Science, Stanford University, Stanford, CA 94305
| | - Nilay Shah
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Eileen Li
- Department of Genetics, Stanford University, Stanford, CA 94305
| | - Melanie Weilert
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | - Miles A. Pufall
- Department of Biochemistry, Carver College of Medicine, University of Iowa, Iowa City, Iowa 52242, USA
| | - Julia Zeitlinger
- Stowers Institute for Medical Research, Kansas City, MO, USA
- The University of Kansas Medical Center, Kansas City, KS, USA
| | - Polly M. Fordyce
- Department of Genetics, Stanford University, Stanford, CA 94305
- Department of Bioengineering, Stanford University, Stanford, CA 94305
- ChEM-H Institute, Stanford University, Stanford, CA 94305
- Chan Zuckerberg Biohub, San Francisco, CA 94110
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA 94305
- Department of Genetics, Stanford University, Stanford, CA 94305
| |
Collapse
|
4
|
Swint-Kruse L, Dougherty LL, Page B, Wu T, O’Neil PT, Prasannan CB, Timmons C, Tang Q, Parente DJ, Sreenivasan S, Holyoak T, Fenton AW. PYK-SubstitutionOME: an integrated database containing allosteric coupling, ligand affinity and mutational, structural, pathological, bioinformatic and computational information about pyruvate kinase isozymes. Database (Oxford) 2023; 2023:baad030. [PMID: 37171062 PMCID: PMC10176505 DOI: 10.1093/database/baad030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 03/29/2023] [Accepted: 04/11/2023] [Indexed: 05/13/2023]
Abstract
Interpreting changes in patient genomes, understanding how viruses evolve and engineering novel protein function all depend on accurately predicting the functional outcomes that arise from amino acid substitutions. To that end, the development of first-generation prediction algorithms was guided by historic experimental datasets. However, these datasets were heavily biased toward substitutions at positions that have not changed much throughout evolution (i.e. conserved). Although newer datasets include substitutions at positions that span a range of evolutionary conservation scores, these data are largely derived from assays that agglomerate multiple aspects of function. To facilitate predictions from the foundational chemical properties of proteins, large substitution databases with biochemical characterizations of function are needed. We report here a database derived from mutational, biochemical, bioinformatic, structural, pathological and computational studies of a highly studied protein family-pyruvate kinase (PYK). A centerpiece of this database is the biochemical characterization-including quantitative evaluation of allosteric regulation-of the changes that accompany substitutions at positions that sample the full conservation range observed in the PYK family. We have used these data to facilitate critical advances in the foundational studies of allosteric regulation and protein evolution and as rigorous benchmarks for testing protein predictions. We trust that the collected dataset will be useful for the broader scientific community in the further development of prediction algorithms. Database URL https://github.com/djparente/PYK-DB.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Larissa L Dougherty
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Braelyn Page
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Tiffany Wu
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Pierce T O’Neil
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Charulata B Prasannan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Cody Timmons
- Chemistry Department, Southwestern Oklahoma State University, 100 Campus Dr., Weatherford, OK 73096, USA
| | - Qingling Tang
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Daniel J Parente
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
- Department of Family Medicine and Community Health, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Shwetha Sreenivasan
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| | - Todd Holyoak
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
- Department of Biology, University of Waterloo, 200 University Ave. W, Waterloo, ON N2L 3G1, Canada
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd., Kansas City, KS 66160, USA
| |
Collapse
|
5
|
Page BM, Martin TA, Wright CL, Fenton LA, Villar MT, Tang Q, Artigues A, Lamb A, Fenton AW, Swint-Kruse L. Odd one out? Functional tuning of Zymomonas mobilis pyruvate kinase is narrower than its allosteric, human counterpart. Protein Sci 2022; 31:e4336. [PMID: 35762709 DOI: 10.1002/pro.4336] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 04/29/2022] [Accepted: 05/03/2022] [Indexed: 11/08/2022]
Abstract
Various protein properties are often illuminated using sequence comparisons of protein homologs. For example, in analyses of the pyruvate kinase multiple sequence alignment, the set of positions that changed during speciation ("phylogenetic" positions) were enriched for "rheostat" positions in human liver pyruvate kinase (hLPYK). (Rheostat positions are those which, when substituted with various amino acids, yield a range of functional outcomes). However, the correlation was moderate, which could result from multiple biophysical constraints acting on the same position during evolution and/or various sources of noise. To further examine this correlation, we here tested Zymomonas mobilis PYK (ZmPYK), which has <65% sequence identity to any other PYK sequence. Twenty-six ZmPYK positions were selected based on their phylogenetic scores, substituted with multiple amino acids, and assessed for changes in Kapp-PEP . Although we expected to identify multiple, strong rheostat positions, only one moderate rheostat position was detected. Instead, nearly half of the 271 ZmPYK variants were inactive and most others showed near wild-type function. Indeed, for the active ZmPYK variants, the total range of Kapp,PEP values ("tunability") was 40-fold less than that observed for hLPYK variants. The combined functional studies and sequence comparisons suggest that ZmPYK has evolved functional and/or structural attributes that differ from the rest of the family. We hypothesize that including such "orphan" sequences in MSA analyses obscures the correlations used to predict rheostat positions. Finally, results raise the intriguing biophysical question as to how the same protein fold can support rheostat positions in one homolog but not another.
Collapse
Affiliation(s)
- Braelyn M Page
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Tyler A Martin
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Collette L Wright
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.,Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, USA
| | - Lauren A Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Maite T Villar
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Qingling Tang
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Antonio Artigues
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Audrey Lamb
- Department of Molecular Biosciences, The University of Kansas, Lawrence, Kansas, USA.,Department of Chemistry, University of Texas at San Antonio, San Antonio, Texas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
6
|
Ruggiero MJ, Malhotra S, Fenton AW, Swint-Kruse L, Karanicolas J, Hagenbuch B. Structural Plasticity Is a Feature of Rheostat Positions in the Human Na +/Taurocholate Cotransporting Polypeptide (NTCP). Int J Mol Sci 2022; 23:ijms23063211. [PMID: 35328632 PMCID: PMC8954283 DOI: 10.3390/ijms23063211] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 03/11/2022] [Accepted: 03/15/2022] [Indexed: 02/05/2023] Open
Abstract
In the Na+/taurocholate cotransporting polypeptide (NTCP), the clinically relevant S267F polymorphism occurs at a "rheostat position". That is, amino acid substitutions at this position ("S267X") lead to a wide range of functional outcomes. This result was particularly striking because molecular models predicted the S267X side chains are buried, and thus, usually expected to be less tolerant of substitutions. To assess whether structural tolerance to buried substitutions is widespread in NTCP, here we used Rosetta to model all 19 potential substitutions at another 13 buried positions. Again, only subtle changes in the calculated stabilities and structures were predicted. Calculations were experimentally validated for 19 variants at codon 271 ("N271X"). Results showed near wildtype expression and rheostatic modulation of substrate transport, implicating N271 as a rheostat position. Notably, each N271X substitution showed a similar effect on the transport of three different substrates and thus did not alter substrate specificity. This differs from S267X, which altered both transport kinetics and specificity. As both transport and specificity may change during protein evolution, the recognition of such rheostat positions may be important for evolutionary studies. We further propose that the presence of rheostat positions is facilitated by local plasticity within the protein structure. Finally, we note that identifying rheostat positions may advance efforts to predict new biomedically relevant missense variants in NTCP and other membrane transport proteins.
Collapse
Affiliation(s)
- Melissa J. Ruggiero
- Department of Pharmacology, Toxicology and Therapeutics, The University of Kansas Medical Center, Kansas City, KS 66160, USA;
| | - Shipra Malhotra
- Program in Molecular Therapeutics, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111, USA; (S.M.); (J.K.)
| | - Aron W. Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA; (A.W.F.); (L.S.-K.)
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, KS 66160, USA; (A.W.F.); (L.S.-K.)
| | - John Karanicolas
- Program in Molecular Therapeutics, Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA 19111, USA; (S.M.); (J.K.)
| | - Bruno Hagenbuch
- Department of Pharmacology, Toxicology and Therapeutics, The University of Kansas Medical Center, Kansas City, KS 66160, USA;
- Correspondence:
| |
Collapse
|
7
|
Fenton KD, Meneely KM, Wu T, Martin TA, Swint‐Kruse L, Fenton AW, Lamb AL. Substitutions at a rheostat position in human aldolase A cause a shift in the conformational population. Protein Sci 2022; 31:357-370. [PMID: 34734672 PMCID: PMC8819835 DOI: 10.1002/pro.4222] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 10/28/2021] [Accepted: 10/28/2021] [Indexed: 02/03/2023]
Abstract
Some protein positions play special roles in determining the magnitude of protein function: at such "rheostat" positions, varied amino acid substitutions give rise to a continuum of functional outcomes, from wild type (or enhanced), to intermediate, to loss of function. This observed range raises interesting questions about the biophysical bases by which changes at single positions have such varied outcomes. Here, we assessed variants at position 98 in human aldolase A ("I98X"). Despite being ~17 Å from the active site and far from subunit interfaces, substitutions at position 98 have rheostatic contributions to the apparent cooperativity (nH ) associated with fructose-1,6-bisphosphate substrate binding and moderately affected binding affinity. Next, we crystallized representative I98X variants to assess structural consequences. Residues smaller than the native isoleucine (cysteine and serine) were readily accommodated, and the larger phenylalanine caused only a slight separation of the two parallel helixes. However, the diffraction quality was reduced for I98F, and further reduced for I98Y. Intriguingly, the resolutions of the I98X structures correlated with their nH values. We propose that substitution effects on both nH and crystal lattice disruption arise from changes in the population of aldolase A conformations in solution. In combination with results computed for rheostat positions in other proteins, the results from this study suggest that rheostat positions accommodate a wide range of side chains and that structural consequences manifest as shifted ensemble populations and/or dynamics changes.
Collapse
Affiliation(s)
- Kathryn D. Fenton
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Kathleen M. Meneely
- Department of ChemistryUniversity of Texas at San AntonioSan AntonioTexasUSA
| | - Tiffany Wu
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Tyler A. Martin
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Liskin Swint‐Kruse
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Aron W. Fenton
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Audrey L. Lamb
- Department of ChemistryUniversity of Texas at San AntonioSan AntonioTexasUSA
| |
Collapse
|
8
|
McCormick JW, Russo MA, Thompson S, Blevins A, Reynolds KA. Structurally distributed surface sites tune allosteric regulation. eLife 2021; 10:68346. [PMID: 34132193 PMCID: PMC8324303 DOI: 10.7554/elife.68346] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 06/15/2021] [Indexed: 11/30/2022] Open
Abstract
Our ability to rationally optimize allosteric regulation is limited by incomplete knowledge of the mutations that tune allostery. Are these mutations few or abundant, structurally localized or distributed? To examine this, we conducted saturation mutagenesis of a synthetic allosteric switch in which Dihydrofolate reductase (DHFR) is regulated by a blue-light sensitive LOV2 domain. Using a high-throughput assay wherein DHFR catalytic activity is coupled to E. coli growth, we assessed the impact of 1548 viable DHFR single mutations on allostery. Despite most mutations being deleterious to activity, fewer than 5% of mutations had a statistically significant influence on allostery. Most allostery disrupting mutations were proximal to the LOV2 insertion site. In contrast, allostery enhancing mutations were structurally distributed and enriched on the protein surface. Combining several allostery enhancing mutations yielded near-additive improvements to dynamic range. Our results indicate a path toward optimizing allosteric function through variation at surface sites. Many proteins exhibit a property called ‘allostery’. In allostery, an input signal at a specific site of a protein – such as a molecule binding, or the protein absorbing a photon of light – leads to a change in output at another site far away. For example, the protein might catalyze a chemical reaction faster or bind to another molecule more tightly in the presence of the input signal. This protein ‘remote control’ allows cells to sense and respond to changes in their environment. An ability to rapidly engineer new allosteric mechanisms into proteins is much sought after because this would provide an approach for building biosensors and other useful tools. One common approach to engineering new allosteric regulation is to combine a ‘sensor’ or input region from one protein with an ‘output’ region or domain from another. When researchers engineer allostery using this approach of combining input and output domains from different proteins, the difference in the output when the input is ‘on’ versus ‘off’ is often small, a situation called ‘modest allostery’. McCormick et al. wanted to know how to optimize this domain combination approach to increase the difference in output between the ‘on’ and ‘off’ states. More specifically, McCormick et al. wanted to find out whether swapping out or mutating specific amino acids (each of the individual building blocks that make up a protein) enhances or disrupts allostery. They also wanted to know if there are many possible mutations that change the effectiveness of allostery, or if this property is controlled by just a few amino acids. Finally, McCormick et al. questioned where in a protein most of these allostery-tuning mutations were located. To answer these questions, McCormick et al. engineered a new allosteric protein by inserting a light-sensing domain (input) into a protein involved in metabolism (a metabolic enzyme that produces a biomolecule called a tetrahydrofolate) to yield a light-controlled enzyme. Next, they introduced mutations into both the ‘input’ and ‘output’ domains to see where they had a greater effect on allostery. After filtering out mutations that destroyed the function of the output domain, McCormick et al. found that only about 5% of mutations to the ‘output’ domain altered the allosteric response of their engineered enzyme. In fact, most mutations that disrupted allostery were found near the site where the ‘input’ domain was inserted, while mutations that enhanced allostery were sprinkled throughout the enzyme, often on its protein surface. This was surprising in light of the commonly-held assumption that mutations on protein surfaces have little impact on the activity of the ‘output’ domain. Overall, the effect of individual mutations on allostery was small, but McCormick et al. found that these mutations can sometimes be combined to yield larger effects. McCormick et al.’s results suggest a new approach for optimizing engineered allosteric proteins: by introducing mutations on the protein surface. It also opens up new questions: mechanically, how do surface sites affect allostery? In the future, it will be important to characterize how combinations of mutations can optimize allosteric regulation, and to determine what evolutionary trajectories to high performance allosteric ‘switches’ look like.
Collapse
Affiliation(s)
- James W McCormick
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| | - Marielle Ax Russo
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| | - Samuel Thompson
- Department of Bioengineering, Stanford University, Stanford, United States
| | - Aubrie Blevins
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States
| | - Kimberly A Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| |
Collapse
|
9
|
Swint-Kruse L, Martin TA, Page BM, Wu T, Gerhart PM, Dougherty LL, Tang Q, Parente DJ, Mosier BR, Bantis LE, Fenton AW. Rheostat functional outcomes occur when substitutions are introduced at nonconserved positions that diverge with speciation. Protein Sci 2021; 30:1833-1853. [PMID: 34076313 DOI: 10.1002/pro.4136] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 05/25/2021] [Accepted: 05/28/2021] [Indexed: 12/14/2022]
Abstract
When amino acids vary during evolution, the outcome can be functionally neutral or biologically-important. We previously found that substituting a subset of nonconserved positions, "rheostat" positions, can have surprising effects on protein function. Since changes at rheostat positions can facilitate functional evolution or cause disease, more examples are needed to understand their unique biophysical characteristics. Here, we explored whether "phylogenetic" patterns of change in multiple sequence alignments (such as positions with subfamily specific conservation) predict the locations of functional rheostat positions. To that end, we experimentally tested eight phylogenetic positions in human liver pyruvate kinase (hLPYK), using 10-15 substitutions per position and biochemical assays that yielded five functional parameters. Five positions were strongly rheostatic and three were non-neutral. To test the corollary that positions with low phylogenetic scores were not rheostat positions, we combined these phylogenetic positions with previously-identified hLPYK rheostat, "toggle" (most substitution abolished function), and "neutral" (all substitutions were like wild-type) positions. Despite representing 428 variants, this set of 33 positions was poorly statistically powered. Thus, we turned to the in vivo phenotypic dataset for E. coli lactose repressor protein (LacI), which comprised 12-13 substitutions at 329 positions and could be used to identify rheostat, toggle, and neutral positions. Combined hLPYK and LacI results show that positions with strong phylogenetic patterns of change are more likely to exhibit rheostat substitution outcomes than neutral or toggle outcomes. Furthermore, phylogenetic patterns were more successful at identifying rheostat positions than were co-evolutionary or eigenvector centrality measures of evolutionary change.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Tyler A Martin
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Braelyn M Page
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Tiffany Wu
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Paige M Gerhart
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Larissa L Dougherty
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.,Department of Biochemistry and Cell Biology, Geisel School of Medicine at Dartmouth College, Hanover, New Hampshire, USA
| | - Qingling Tang
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Daniel J Parente
- Department of Family Medicine and Community Health, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Brian R Mosier
- Department of Biostatistics and Data Science, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Leonidas E Bantis
- Department of Biostatistics and Data Science, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
10
|
Ruggiero MJ, Malhotra S, Fenton AW, Swint-Kruse L, Karanicolas J, Hagenbuch B. A clinically relevant polymorphism in the Na +/taurocholate cotransporting polypeptide (NTCP) occurs at a rheostat position. J Biol Chem 2020; 296:100047. [PMID: 33168628 PMCID: PMC7948949 DOI: 10.1074/jbc.ra120.014889] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 10/22/2020] [Accepted: 11/09/2020] [Indexed: 12/28/2022] Open
Abstract
Conventionally, most amino acid substitutions at “important” protein positions are expected to abolish function. However, in several soluble-globular proteins, we identified a class of nonconserved positions for which various substitutions produced progressive functional changes; we consider these evolutionary “rheostats”. Here, we report a strong rheostat position in the integral membrane protein, Na+/taurocholate (TCA) cotransporting polypeptide, at the site of a pharmacologically relevant polymorphism (S267F). Functional studies were performed for all 20 substitutions (S267X) with three substrates (TCA, estrone-3-sulfate, and rosuvastatin). The S267X set showed strong rheostatic effects on overall transport, and individual substitutions showed varied effects on transport kinetics (Km and Vmax) and substrate specificity. To assess protein stability, we measured surface expression and used the Rosetta software (https://www.rosettacommons.org) suite to model structure and stability changes of S267X. Although buried near the substrate-binding site, S267X substitutions were easily accommodated in the Na+/TCA cotransporting polypeptide structure model. Across the modest range of changes, calculated stabilities correlated with surface-expression differences, but neither parameter correlated with altered transport. Thus, substitutions at rheostat position 267 had wide-ranging effects on the phenotype of this integral membrane protein. We further propose that polymorphic positions in other proteins might be locations of rheostat positions.
Collapse
Affiliation(s)
- Melissa J Ruggiero
- Department of Pharmacology, Toxicology and Therapeutics, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Shipra Malhotra
- Program in Molecular Therapeutics, Fox Chase Cancer Center, Philadelphia, Pennsylvania, USA; Center for Computational Biology, University of Kansas, Lawrence, Kansas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - John Karanicolas
- Program in Molecular Therapeutics, Fox Chase Cancer Center, Philadelphia, Pennsylvania, USA
| | - Bruno Hagenbuch
- Department of Pharmacology, Toxicology and Therapeutics, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| |
Collapse
|
11
|
Martin TA, Wu T, Tang Q, Dougherty LL, Parente DJ, Swint-Kruse L, Fenton AW. Identification of biochemically neutral positions in liver pyruvate kinase. Proteins 2020; 88:1340-1350. [PMID: 32449829 DOI: 10.1002/prot.25953] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 03/10/2020] [Accepted: 05/16/2020] [Indexed: 01/08/2023]
Abstract
Understanding how each residue position contributes to protein function has been a long-standing goal in protein science. Substitution studies have historically focused on conserved protein positions. However, substitutions of nonconserved positions can also modify function. Indeed, we recently identified nonconserved positions that have large substitution effects in human liver pyruvate kinase (hLPYK), including altered allosteric coupling. To facilitate a comparison of which characteristics determine when a nonconserved position does vs does not contribute to function, the goal of the current work was to identify neutral positions in hLPYK. However, existing hLPYK data showed that three features commonly associated with neutral positions-high sequence entropy, high surface exposure, and alanine scanning-lacked the sensitivity needed to guide experimental studies. We used multiple evolutionary patterns identified in a sequence alignment of the PYK family to identify which positions were least patterned, reasoning that these were most likely to be neutral. Nine positions were tested with a total of 117 amino acid substitutions. Although exploring all potential functions is not feasible for any protein, five parameters associated with substrate/effector affinities and allosteric coupling were measured for hLPYK variants. For each position, the aggregate functional outcomes of all variants were used to quantify a "neutrality" score. Three positions showed perfect neutral scores for all five parameters. Furthermore, the nine positions showed larger neutral scores than 17 positions located near allosteric binding sites. Thus, our strategy successfully enriched the dataset for positions with neutral and modest substitutions.
Collapse
Affiliation(s)
- Tyler A Martin
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Tiffany Wu
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Qingling Tang
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Larissa L Dougherty
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Daniel J Parente
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.,Department of Family and Community Medicine, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
12
|
Abstract
To achieve the full potential of pharmacogenomics, one must accurately predict the functional outcomes that arise from amino acid substitutions in proteins. Classically, researchers have focused on understanding the consequences of individual substitutions. However, literature surveys have shown that most substitutions were created at evolutionarily conserved positions. Awareness of this bias leads to a shift in perspective, from considering the outcomes of individual substitutions to understanding the roles of individual protein positions. Conserved positions tend to act as “toggle” switches, with most substitutions abolishing function. However, nonconserved positions have been found equally capable of affecting protein function. Indeed, many nonconserved positions act like functional dimmer switches (“rheostat” positions): this is revealed when multiple substitutions are made at a single position. Each substitution has a different functional outcome; the set of substitutions spans a range of outcomes. Finally, some nonconserved positions appear neutral, capable of accommodating all amino acid types without modifying function. This paper reviews the currently-known properties of rheostat positions, with examples shown for pyruvate kinase, organic anion transporting polypeptide 1B1, the beta-lactamase inhibitory protein, and angiotensin-converting enzyme 2. Outcomes observed for rheostat positions have implications for the rational design of drug analogs and allosteric drugs. Furthermore, this new framework—comprising three types of protein positions—provides a new approach to interpreting disease and population-based databases of amino acid changes. In conclusion, although a full understanding of substitution outcomes at rheostat positions poses a challenge, utilization of this new frame of reference will further advance the application of pharmacogenomics.
Collapse
|