1
|
Sternke M, Tripp KW, Barrick D. The use of consensus sequence information to engineer stability and activity in proteins. Methods Enzymol 2020; 643:149-179. [PMID: 32896279 DOI: 10.1016/bs.mie.2020.06.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The goal of protein design is to create proteins that are stable, soluble, and active. Here we focus on one approach to protein design in which sequence information is used to create a "consensus" sequence. Such consensus sequences comprise the most common residue at each position in a multiple sequence alignment (MSA). After describing some general ideas that relate MSA and consensus sequences and presenting a statistical thermodynamic framework that relates consensus and non-consensus sequences to stability, we detail the process of designing a consensus sequence and survey reports of consensus design and characterization from the literature. Many of these consensus proteins retain native biological activities including ligand binding and enzyme activity. Remarkably, in most cases the consensus protein shows significantly higher stability than extant versions of the protein, as measured by thermal or chemical denaturation, consistent with the statistical thermodynamic model. To understand this stability increase, we compare various features of consensus sequences with the extant MSA sequences from which they were derived. Consensus sequences show enrichment in charged residues (most notably glutamate and lysine) and depletion of uncharged polar residues (glutamine, serine, and asparagine). Surprisingly, a survey of stability changes resulting from point substitutions show little correlation with residue frequencies at the corresponding positions within the MSA, suggesting that the high stability of consensus proteins may result from interactions among residue pairs or higher-order clusters. Whatever the source, the large number of reported successes demonstrates that consensus design is a viable route to generating active and in many cases highly stabilized proteins.
Collapse
Affiliation(s)
- Matt Sternke
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, United States; Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, United States
| | - Katherine W Tripp
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, United States
| | - Doug Barrick
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD, United States.
| |
Collapse
|
2
|
Klein SA, Majumdar A, Barrick D. A Second Backbone: The Contribution of a Buried Asparagine Ladder to the Global and Local Stability of a Leucine-Rich Repeat Protein. Biochemistry 2019; 58:3480-3493. [PMID: 31347358 PMCID: PMC7184636 DOI: 10.1021/acs.biochem.9b00355] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Parallel β-sheet-containing repeat proteins often display a structural motif in which conserved asparagines form a continuous ladder buried within the hydrophobic core. In such "asparagine ladders", the asparagine side-chain amides form a repetitive pattern of hydrogen bonds with neighboring main-chain NH and CO groups. Although asparagine ladders have been thought to be important for stability, there is little experimental evidence to support such speculation. Here we test the contribution of a minimal asparagine ladder from the leucine-rich repeat protein pp32 to stability and investigate lattice rigidity and hydrogen bond character using solution nuclear magnetic resonance (NMR) spectroscopy. Point substitutions of the two ladder asparagines of pp32 are strongly destabilizing and decrease the cooperativity of unfolding. The chemical shifts of the ladder side-chain HZ protons are shifted significantly downfield in the NMR spectrum and have low temperature coefficients, indicative of strong hydrogen bonding. In contrast, the HE protons are shifted upfield and have temperature coefficients close to zero, suggesting an asymmetry in hydrogen bond strength along the ladder. Ladder NH2 groups have weak 1H-15N cross-peak intensities; 1H-15N nuclear Overhauser effect and 15N CPMG experiments show this to be the result of high rigidity. Hydrogen exchange measurements demonstrate that the ladder NH2 groups exchange very slowly, with rates approaching the global exchange limit. Overall, these results show that the asparagine side chains are held in a very rigid, nondynamic structure, making a significant contribution to the overall stability. In this regard, buried asparagine ladders can be considered "second backbones" within the cores of their elongated β-sheet host proteins.
Collapse
Affiliation(s)
- Sean A. Klein
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218 USA
| | - Ananya Majumdar
- The Johns Hopkins University Biomolecular NMR Center, Johns Hopkins University, Baltimore, Maryland, 21218
| | - Doug Barrick
- T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218 USA
| |
Collapse
|
3
|
Goyal VD, Sullivan BJ, Magliery TJ. Phylogenetic spread of sequence data affects fitness of consensus enzymes: Insights from triosephosphate isomerase. Proteins 2019; 88:274-283. [PMID: 31407418 DOI: 10.1002/prot.25799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 07/26/2019] [Accepted: 08/08/2019] [Indexed: 11/08/2022]
Abstract
The concept of consensus in multiple sequence alignments (MSAs) has been used to design and engineer proteins previously with some success. However, consensus design implicitly assumes that all amino acid positions function independently, whereas in reality, the amino acids in a protein interact with each other and work cooperatively to produce the optimum structure required for its function. Correlation analysis is a tool that can capture the effect of such interactions. In a previously published study, we made consensus variants of the triosephosphate isomerase (TIM) protein using MSAs that included sequences form both prokaryotic and eukaryotic organisms. These variants were not completely native-like and were also surprisingly different from each other in terms of oligomeric state, structural dynamics, and activity. Extensive correlation analysis of the TIM database has revealed some clues about factors leading to the unusual behavior of the previously constructed consensus proteins. Among other things, we have found that the more ill-behaved consensus mutant had more broken correlations than the better-behaved consensus variant. Moreover, we report three correlation and phylogeny-based consensus variants of TIM. These variants were more native-like than the previous consensus mutants and considerably more stable than a wild-type TIM from a mesophilic organism. This study highlights the importance of choosing the appropriate diversity of MSA for consensus analysis and provides information that can be used to engineer stable enzymes.
Collapse
Affiliation(s)
- Venuka Durani Goyal
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio
| | - Brandon J Sullivan
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio.,Ohio State Biochemistry Program, The Ohio State University, Columbus, Ohio
| | - Thomas J Magliery
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio
| |
Collapse
|
4
|
Consensus sequence design as a general strategy to create hyperstable, biologically active proteins. Proc Natl Acad Sci U S A 2019; 116:11275-11284. [PMID: 31110018 DOI: 10.1073/pnas.1816707116] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Consensus sequence design offers a promising strategy for designing proteins of high stability while retaining biological activity since it draws upon an evolutionary history in which residues important for both stability and function are likely to be conserved. Although there have been several reports of successful consensus design of individual targets, it is unclear from these anecdotal studies how often this approach succeeds and how often it fails. Here, we attempt to assess generality by designing consensus sequences for a set of six protein families with a range of chain lengths, structures, and activities. We characterize the resulting consensus proteins for stability, structure, and biological activities in an unbiased way. We find that all six consensus proteins adopt cooperatively folded structures in solution. Strikingly, four of six of these consensus proteins show increased thermodynamic stability over naturally occurring homologs. Each consensus protein tested for function maintained at least partial biological activity. Although peptide binding affinity by a consensus-designed SH3 is rather low, K m values for consensus enzymes are similar to values from extant homologs. Although consensus enzymes are slower than extant homologs at low temperature, they are faster than some thermophilic enzymes at high temperature. An analysis of sequence properties shows consensus proteins to be enriched in charged residues, and rarified in uncharged polar residues. Sequence differences between consensus and extant homologs are predominantly located at weakly conserved surface residues, highlighting the importance of these residues in the success of the consensus strategy.
Collapse
|
5
|
Musil M, Konegger H, Hon J, Bednar D, Damborsky J. Computational Design of Stable and Soluble Biocatalysts. ACS Catal 2018. [DOI: 10.1021/acscatal.8b03613] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Milos Musil
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 612 66 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - Hannes Konegger
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Hon
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, 612 66 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Centre for Toxic Compounds in the Environment (RECETOX), and Department of Experimental Biology, Faculty of Science, Masaryk University, 625 00 Brno, Czech Republic
- International Clinical Research Center, St. Anne’s University Hospital, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|