1
|
Koch NG, Budisa N. Evolution of Pyrrolysyl-tRNA Synthetase: From Methanogenesis to Genetic Code Expansion. Chem Rev 2024. [PMID: 38953775 DOI: 10.1021/acs.chemrev.4c00031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]
Abstract
Over 20 years ago, the pyrrolysine encoding translation system was discovered in specific archaea. Our Review provides an overview of how the once obscure pyrrolysyl-tRNA synthetase (PylRS) tRNA pair, originally responsible for accurately translating enzymes crucial in methanogenic metabolic pathways, laid the foundation for the burgeoning field of genetic code expansion. Our primary focus is the discussion of how to successfully engineer the PylRS to recognize new substrates and exhibit higher in vivo activity. We have compiled a comprehensive list of ncAAs incorporable with the PylRS system. Additionally, we also summarize recent successful applications of the PylRS system in creating innovative therapeutic solutions, such as new antibody-drug conjugates, advancements in vaccine modalities, and the potential production of new antimicrobials.
Collapse
Affiliation(s)
- Nikolaj G Koch
- Department of Chemistry, Institute of Physical Chemistry, University of Basel, 4058 Basel, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, 4058 Basel, Switzerland
| | - Nediljko Budisa
- Biocatalysis Group, Institute of Chemistry, Technische Universität Berlin, 10623 Berlin, Germany
- Chemical Synthetic Biology Chair, Department of Chemistry, University of Manitoba, Winnipeg MB R3T 2N2, Canada
| |
Collapse
|
2
|
Nixon C, Lim SA, Sternke M, Barrick D, Harms MJ, Marqusee S. The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins. Protein Sci 2024; 33:e5011. [PMID: 38747388 PMCID: PMC11094778 DOI: 10.1002/pro.5011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 04/02/2024] [Accepted: 04/23/2024] [Indexed: 05/19/2024]
Abstract
A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the consensus protein derived from our full Ribonuclease H sequence alignment is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted set of sequences is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order sequence correlations using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.
Collapse
Affiliation(s)
- Charlotte Nixon
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| | - Shion A. Lim
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
| | - Matt Sternke
- The T.C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Doug Barrick
- The T.C. Jenkins Department of BiophysicsJohns Hopkins UniversityBaltimoreMarylandUSA
| | - Michael J. Harms
- Department of Chemistry and BiochemistryUniversity of OregonEugeneOregonUSA
| | - Susan Marqusee
- Department of Molecular and Cell BiologyUniversity of California, BerkeleyBerkeleyCaliforniaUSA
- Department of ChemistryUniversity of California, BerkeleyBerkeleyCaliforniaUSA
- California Institute for Quantitative Biosciences (QB3)BerkeleyCaliforniaUSA
| |
Collapse
|
3
|
Johnson SR, Fu X, Viknander S, Goldin C, Monaco S, Zelezniak A, Yang KK. Computational scoring and experimental evaluation of enzymes generated by neural networks. Nat Biotechnol 2024:10.1038/s41587-024-02214-2. [PMID: 38653796 DOI: 10.1038/s41587-024-02214-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 03/20/2024] [Indexed: 04/25/2024]
Abstract
In recent years, generative protein sequence models have been developed to sample novel sequences. However, predicting whether generated proteins will fold and function remains challenging. We evaluate a set of 20 diverse computational metrics to assess the quality of enzyme sequences produced by three contrasting generative models: ancestral sequence reconstruction, a generative adversarial network and a protein language model. Focusing on two enzyme families, we expressed and purified over 500 natural and generated sequences with 70-90% identity to the most similar natural sequences to benchmark computational metrics for predicting in vitro enzyme activity. Over three rounds of experiments, we developed a computational filter that improved the rate of experimental success by 50-150%. The proposed metrics and models will drive protein engineering research by serving as a benchmark for generative protein sequence models and helping to select active variants for experimental testing.
Collapse
Affiliation(s)
| | - Xiaozhi Fu
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Sandra Viknander
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Clara Goldin
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | | | - Aleksej Zelezniak
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
- Institute of Biotechnology, Life Sciences Centre, Vilnius University, Vilnius, Lithuania.
- Randall Centre for Cell & Molecular Biophysics, King's College London, Guy's Campus, London, UK.
| | | |
Collapse
|
4
|
Tripp A, Braun M, Wieser F, Oberdorfer G, Lechner H. Click, Compute, Create: A Review of Web-based Tools for Enzyme Engineering. Chembiochem 2024:e202400092. [PMID: 38634409 DOI: 10.1002/cbic.202400092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/14/2024] [Accepted: 04/15/2024] [Indexed: 04/19/2024]
Abstract
Enzyme engineering, though pivotal across various biotechnological domains, is often plagued by its time-consuming and labor-intensive nature. This review aims to offer an overview of supportive in silico methodologies for this demanding endeavor. Starting from methods to predict protein structures, to classification of their activity and even the discovery of new enzymes we continue with describing tools used to increase thermostability and production yields of selected targets. Subsequently, we discuss computational methods to modulate both, the activity as well as selectivity of enzymes. Last, we present recent approaches based on cutting-edge machine learning methods to redesign enzymes. With exception of the last chapter, there is a strong focus on methods easily accessible via web-interfaces or simple Python-scripts, therefore readily useable for a diverse and broad community.
Collapse
Affiliation(s)
- Adrian Tripp
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Markus Braun
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Florian Wieser
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Gustav Oberdorfer
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
- BioTechMed, Graz, Austria
| | - Horst Lechner
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
- BioTechMed, Graz, Austria
| |
Collapse
|
5
|
Sennett MA, Theobald DL. Extant Sequence Reconstruction: The Accuracy of Ancestral Sequence Reconstructions Evaluated by Extant Sequence Cross-Validation. J Mol Evol 2024; 92:181-206. [PMID: 38502220 PMCID: PMC10978691 DOI: 10.1007/s00239-024-10162-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 02/20/2024] [Indexed: 03/21/2024]
Abstract
Ancestral sequence reconstruction (ASR) is a phylogenetic method widely used to analyze the properties of ancient biomolecules and to elucidate mechanisms of molecular evolution. Despite its increasingly widespread application, the accuracy of ASR is currently unknown, as it is generally impossible to compare resurrected proteins to the true ancestors. Which evolutionary models are best for ASR? How accurate are the resulting inferences? Here we answer these questions using a cross-validation method to reconstruct each extant sequence in an alignment with ASR methodology, a method we term "extant sequence reconstruction" (ESR). We thus can evaluate the accuracy of ASR methodology by comparing ESR reconstructions to the corresponding known true sequences. We find that a common measure of the quality of a reconstructed sequence, the average probability, is indeed a good estimate of the fraction of correct amino acids when the evolutionary model is accurate or overparameterized. However, the average probability is a poor measure for comparing reconstructions from different models, because, surprisingly, a more accurate phylogenetic model often results in reconstructions with lower probability. While better (more predictive) models may produce reconstructions with lower sequence identity to the true sequences, better models nevertheless produce reconstructions that are more biophysically similar to true ancestors. In addition, we find that a large fraction of sequences sampled from the reconstruction distribution may have fewer errors than the single most probable (SMP) sequence reconstruction, despite the fact that the SMP has the lowest expected error of all possible sequences. Our results emphasize the importance of model selection for ASR and the usefulness of sampling sequence reconstructions for analyzing ancestral protein properties. ESR is a powerful method for validating the evolutionary models used for ASR and can be applied in practice to any phylogenetic analysis of real biological sequences. Most significantly, ESR uses ASR methodology to provide a general method by which the biophysical properties of resurrected proteins can be compared to the properties of the true protein.
Collapse
Affiliation(s)
- Michael A Sennett
- Department of Biochemistry, Brandeis University, Waltham, MA, 02453, USA
| | - Douglas L Theobald
- Department of Biochemistry, Brandeis University, Waltham, MA, 02453, USA.
| |
Collapse
|
6
|
Li ZL, Sun CQ, Qing ZL, Li ZM, Liu HL. Engineering the thermal stability of a polyphosphate kinase by ancestral sequence reconstruction to expand the temperature boundary for an industrially applicable ATP regeneration system. Appl Environ Microbiol 2024; 90:e0157423. [PMID: 38236018 PMCID: PMC10880597 DOI: 10.1128/aem.01574-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2023] [Accepted: 12/06/2023] [Indexed: 01/19/2024] Open
Abstract
ATP-dependent energy-consuming enzymatic reactions are widely used in cell-free biocatalysis. However, the direct addition of large amounts of expensive ATP can greatly increase cost, and enzymatic production is often difficult to achieve as a result. Although a polyphosphate kinase (PPK)-polyphosphate-based ATP regeneration system has the potential to solve this challenge, the generally poor thermal stability of PPKs limits the widespread use of this method. In this paper, we evaluated the thermal stability of a PPK from Sulfurovum lithotrophicum (SlPPK2). After directed evolution and computation-supported design, we found that SlPPK2 is very recalcitrant and cannot acquire beneficial mutations. Inspired by the usually outstanding stability of ancestral enzymes, we reconstructed the ancestral sequence of the PPK family and used it as a guide to construct three heat-stable variants of SlPPK2, of which the L35F/T144S variant has a half-life of more than 14 h at 60°C. Molecular dynamics simulations were performed on all enzymes to analyze the reasons for the increased thermal stability. The results showed that mutations at these two positions act synergistically from the interior and surface of the protein, leading to a more compact structure. Finally, the robustness of the L35F/T144S variant was verified in the synthesis of nucleotides at high temperature. In practice, the use of this high-temperature ATP regeneration system can effectively avoid byproduct accumulation. Our work extends the temperature boundary of ATP regeneration and has great potential for industrial applications.IMPORTANCEATP regeneration is an important basic applied study in the field of cell-free biocatalysis. Polyphosphate kinase (PPK) is an enzyme tool widely used for energy regeneration during enzymatic reactions. However, the thermal stability of the PPKs reported to date that can efficiently regenerate ATP is usually poor, which greatly limits their application. In this study, the thermal stability of a difficult-to-engineer PPK from Sulfurovum lithotrophicum was improved, guided by an ancestral sequence reconstruction strategy. The optimal variant has a 4.5-fold longer half-life at 60°C than the wild-type enzyme, thus enabling the extension of the temperature boundary for ATP regeneration. The ability of this variant to regenerate ATP was well demonstrated during high-temperature enzymatic production of nucleotides.
Collapse
Affiliation(s)
- Zong-Lin Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Chuan-Qi Sun
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Zhou-Lei Qing
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
| | - Zhi-Min Li
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, China
- Shanghai Collaborative Innovation Center for Biomanufacturing Technology, Shanghai, China
| | - Hong-Lai Liu
- School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai, China
| |
Collapse
|
7
|
Fujikawa T, Sasamoto T, Zhao F, Yamagishi A, Akanuma S. Comparative analysis of reconstructed ancestral proteins with their extant counterparts suggests primitive life had an alkaline habitat. Sci Rep 2024; 14:398. [PMID: 38172176 PMCID: PMC10764835 DOI: 10.1038/s41598-023-50828-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 12/26/2023] [Indexed: 01/05/2024] Open
Abstract
To understand the origin and early evolution of life it is crucial to establish characteristics of the primordial environment that facilitated the emergence and evolution of life. One important environmental factor is the pH of the primordial environment. Here, we assessed the pH-dependent thermal stabilities of previously reconstructed ancestral nucleoside diphosphate kinases and ribosomal protein uS8s. The selected proteins were likely to be present in ancient organisms such as the last common ancestor of bacteria and that of archaea. We also assessed the thermal stability of homologous proteins from extant acidophilic, neutralophilic, and alkaliphilic microorganisms as a function of pH. Our results indicate that the reconstructed ancestral proteins are more akin to those of extant alkaliphilic bacteria, which display greater stability under alkaline conditions. These findings suggest that the common ancestors of bacterial and archaeal species thrived in an alkaline environment. Moreover, we demonstrate the reconstruction method employed in this study is a valuable technique for generating alkali-tolerant proteins that can be used in a variety of biotechnological and environmental applications.
Collapse
Affiliation(s)
- Takayuki Fujikawa
- Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Takahiro Sasamoto
- Department of Applied Life Science, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, 192-0392, Japan
| | - Fangzheng Zhao
- Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Science, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, 192-0392, Japan
| | - Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan.
| |
Collapse
|
8
|
Chisholm LO, Orlandi KN, Phillips SR, Shavlik MJ, Harms MJ. Ancestral Reconstruction and the Evolution of Protein Energy Landscapes. Annu Rev Biophys 2023; 53:10.1146/annurev-biophys-030722-125440. [PMID: 38134334 PMCID: PMC11192866 DOI: 10.1146/annurev-biophys-030722-125440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2023]
Abstract
A protein's sequence determines its conformational energy landscape. This, in turn, determines the protein's function. Understanding the evolution of new protein functions therefore requires understanding how mutations alter the protein energy landscape. Ancestral sequence reconstruction (ASR) has proven a valuable tool for tackling this problem. In ASR, one phylogenetically infers the sequences of ancient proteins, allowing characterization of their properties. When coupled to biophysical, biochemical, and functional characterization, ASR can reveal how historical mutations altered the energy landscape of ancient proteins, allowing the evolution of enzyme activity, altered conformations, binding specificity, oligomerization, and many other protein features. In this article, we review how ASR studies have been used to dissect the evolution of energy landscapes. We also discuss ASR studies that reveal how energy landscapes have shaped protein evolution. Finally, we propose that thinking about evolution from the perspective of an energy landscape can improve how we approach and interpret ASR studies. Expected final online publication date for the Annual Review of Biophysics, Volume 53 is May 2024. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Lauren O Chisholm
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Kona N Orlandi
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Sophia R Phillips
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Shavlik
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
- Department of Biology, University of Oregon, Eugene, Oregon, USA
| | - Michael J Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, Oregon, USA;
- Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA
| |
Collapse
|
9
|
Muellers SN, Allen KN, Whitty A. MEnTaT: A machine-learning approach for the identification of mutations to increase protein stability. Proc Natl Acad Sci U S A 2023; 120:e2309884120. [PMID: 38039271 PMCID: PMC10710055 DOI: 10.1073/pnas.2309884120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 10/16/2023] [Indexed: 12/03/2023] Open
Abstract
Enhancing protein thermal stability is important for biomedical and industrial applications as well as in the research laboratory. Here, we describe a simple machine-learning method which identifies amino acid substitutions that contribute to thermal stability based on comparison of the amino acid sequences of homologous proteins derived from bacteria that grow at different temperatures. A key feature of the method is that it compares the sequences based not simply on the amino acid identity, but rather on the structural and physicochemical properties of the side chain. The method accurately identified stabilizing substitutions in three well-studied systems and was validated prospectively by experimentally testing predicted stabilizing substitutions in a polyamine oxidase. In each case, the method outperformed the widely used bioinformatic consensus approach. The method can also provide insight into fundamental aspects of protein structure, for example, by identifying how many sequence positions in a given protein are relevant to temperature adaptation.
Collapse
Affiliation(s)
| | - Karen N. Allen
- Department of Chemistry, Boston University, Boston, MA02215
| | - Adrian Whitty
- Department of Chemistry, Boston University, Boston, MA02215
| |
Collapse
|
10
|
Ji Z, Belfield EJ, Zhang S, Bouvier J, Li S, Schnell J, Fu X, Harberd NP. Evolution of a plant growth-regulatory protein interaction specificity. NATURE PLANTS 2023; 9:2059-2070. [PMID: 37903985 PMCID: PMC10724065 DOI: 10.1038/s41477-023-01556-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 09/27/2023] [Indexed: 11/01/2023]
Abstract
Specific protein-protein interactions (PPIs) enable biological regulation. However, the evolution of PPI specificity is little understood. Here we trace the evolution of the land-plant growth-regulatory DELLA-SLY1/GID2 PPI, revealing progressive increase in specificity of affinity of SLY1/GID2 for a particular DELLA form. While early-diverging SLY1s display relatively broad-range DELLA affinity, later-diverging SLY1s tend towards increasingly stringent affinity for a specific DELLA A' form generated by the growth-promoting phytohormone gibberellin (GA). Our novel mutational strategy reveals amino acid substitutions contributing to the evolution of Arabidopsis thaliana SLY1 A' specificity, also showing that routes permitting reversion to broader affinity became increasingly constrained over evolutionary time. We suggest that progressive affinity narrowing may be an important evolutionary driver of PPI specificity and that increase in SLY1/GID2-DELLA specificity enabled the enhanced flexibility of plant physiological environmental adaptation conferred by the GA-DELLA growth-regulatory mechanism.
Collapse
Affiliation(s)
- Zhe Ji
- Department of Biology, University of Oxford, Oxford, UK
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, P. R. China
| | | | - Siyu Zhang
- National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing, PR China
| | | | - Shan Li
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, P. R. China
- National Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Nanjing Agricultural University, Nanjing, PR China
| | - Jason Schnell
- Department of Biochemistry, University of Oxford, Oxford, UK
| | - Xiangdong Fu
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, P. R. China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, P. R. China
- New Cornerstone Science Laboratory, Beijing, P. R. China
| | | |
Collapse
|
11
|
Romero-Romero S, Lindner S, Ferruz N. Exploring the Protein Sequence Space with Global Generative Models. Cold Spring Harb Perspect Biol 2023; 15:a041471. [PMID: 37848247 PMCID: PMC10626256 DOI: 10.1101/cshperspect.a041471] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2023]
Abstract
Recent advancements in specialized large-scale architectures for training images and language have profoundly impacted the field of computer vision and natural language processing (NLP). Language models, such as the recent ChatGPT and GPT-4, have demonstrated exceptional capabilities in processing, translating, and generating human language. These breakthroughs have also been reflected in protein research, leading to the rapid development of numerous new methods in a short time, with unprecedented performance. Several of these models have been developed with the goal of generating sequences in novel regions of the protein space. In this work, we provide an overview of the use of protein generative models, reviewing (1) language models for the design of novel artificial proteins, (2) works that use non-transformer architectures, and (3) applications in directed evolution approaches.
Collapse
Affiliation(s)
| | | | - Noelia Ferruz
- Barcelona Institute of Molecular Biology, 08028 Barcelona, Spain
| |
Collapse
|
12
|
Nicoll CR, Massari M, Fraaije MW, Mascotti ML, Mattevi A. Impact of ancestral sequence reconstruction on mechanistic and structural enzymology. Curr Opin Struct Biol 2023; 82:102669. [PMID: 37544113 DOI: 10.1016/j.sbi.2023.102669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 06/19/2023] [Accepted: 07/10/2023] [Indexed: 08/08/2023]
Abstract
Ancestral sequence reconstruction (ASR) provides insight into the changes within a protein sequence across evolution. More specifically, it can illustrate how specific amino acid changes give rise to different phenotypes within a protein family. Over the last few decades it has established itself as a powerful technique for revealing molecular common denominators that govern enzyme function. Here, we describe the strength of ASR in unveiling catalytic mechanisms and emerging phenotypes for a range of different proteins, also highlighting biotechnological applications the methodology can provide.
Collapse
Affiliation(s)
- Callum R Nicoll
- Department of Biology and Biotechnology Lazzaro Spallanzani, University of Pavia, Via Ferrata 9, 27100, Pavia, Italy
| | - Marta Massari
- Department of Biology and Biotechnology Lazzaro Spallanzani, University of Pavia, Via Ferrata 9, 27100, Pavia, Italy
| | - Marco W Fraaije
- Molecular Enzymology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747, AG Groningen, the Netherlands. https://twitter.com/fraaije1
| | - Maria Laura Mascotti
- Molecular Enzymology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747, AG Groningen, the Netherlands; IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, Ejército de los Andes 950, D5700HHW, San Luis, Argentina
| | - Andrea Mattevi
- Department of Biology and Biotechnology Lazzaro Spallanzani, University of Pavia, Via Ferrata 9, 27100, Pavia, Italy.
| |
Collapse
|
13
|
Wijayanti SD, Tsvik L, Haltrich D. Recent Advances in Electrochemical Enzyme-Based Biosensors for Food and Beverage Analysis. Foods 2023; 12:3355. [PMID: 37761066 PMCID: PMC10529900 DOI: 10.3390/foods12183355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 08/28/2023] [Accepted: 09/04/2023] [Indexed: 09/29/2023] Open
Abstract
Food analysis and control are crucial aspects in food research and production in order to ensure quality and safety of food products. Electrochemical biosensors based on enzymes as the bioreceptors are emerging as promising tools for food analysis because of their high selectivity and sensitivity, short analysis time, and high-cost effectiveness in comparison to conventional methods. This review provides the readers with an overview of various electrochemical enzyme-based biosensors in food analysis, focusing on enzymes used for different applications in the analysis of sugars, alcohols, amino acids and amines, and organic acids, as well as mycotoxins and chemical contaminants. In addition, strategies to improve the performance of enzyme-based biosensors that have been reported over the last five years will be discussed. The challenges and future outlooks for the food sector are also presented.
Collapse
Affiliation(s)
- Sudarma Dita Wijayanti
- Laboratory of Food Biotechnology, Department of Food Science and Technology, University of Natural Resources and Life Sciences Vienna, Muthgasse 11, A-1190 Wien, Austria; (S.D.W.)
- Department of Food Science and Biotechnology, Brawijaya University, Malang 65145, Indonesia
| | - Lidiia Tsvik
- Laboratory of Food Biotechnology, Department of Food Science and Technology, University of Natural Resources and Life Sciences Vienna, Muthgasse 11, A-1190 Wien, Austria; (S.D.W.)
| | - Dietmar Haltrich
- Laboratory of Food Biotechnology, Department of Food Science and Technology, University of Natural Resources and Life Sciences Vienna, Muthgasse 11, A-1190 Wien, Austria; (S.D.W.)
| |
Collapse
|
14
|
Buda K, Miton CM, Fan XC, Tokuriki N. Molecular determinants of protein evolvability. Trends Biochem Sci 2023; 48:751-760. [PMID: 37330341 DOI: 10.1016/j.tibs.2023.05.009] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 05/18/2023] [Accepted: 05/23/2023] [Indexed: 06/19/2023]
Abstract
The plethora of biological functions that sustain life is rooted in the remarkable evolvability of proteins. An emerging view highlights the importance of a protein's initial state in dictating evolutionary success. A deeper comprehension of the mechanisms that govern the evolvability of these initial states can provide invaluable insights into protein evolution. In this review, we describe several molecular determinants of protein evolvability, unveiled by experimental evolution and ancestral sequence reconstruction studies. We further discuss how genetic variation and epistasis can promote or constrain functional innovation and suggest putative underlying mechanisms. By establishing a clear framework for these determinants, we provide potential indicators enabling the forecast of suitable evolutionary starting points and delineate molecular mechanisms in need of deeper exploration.
Collapse
Affiliation(s)
- Karol Buda
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Charlotte M Miton
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Xingyu Cara Fan
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada.
| |
Collapse
|
15
|
Dishman AF, Volkman BF. Metamorphic protein folding as evolutionary adaptation. Trends Biochem Sci 2023; 48:665-672. [PMID: 37270322 PMCID: PMC10526677 DOI: 10.1016/j.tibs.2023.05.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 04/12/2023] [Accepted: 05/04/2023] [Indexed: 06/05/2023]
Abstract
Metamorphic proteins switch reversibly between multiple distinct, stable structures, often with different functions. It was previously hypothesized that metamorphic proteins arose as intermediates in the evolution of a new fold - rare and transient exceptions to the 'one sequence, one fold' paradigm. However, as described herein, mounting evidence suggests that metamorphic folding is an adaptive feature, preserved and optimized over evolutionary time as exemplified by the NusG family and the chemokine XCL1. Analysis of extant protein families and resurrected protein ancestors demonstrates that large regions of sequence space are compatible with metamorphic folding. As a category that enhances biological fitness, metamorphic proteins are likely to employ fold switching to perform important biological functions and may be more common than previously thought.
Collapse
Affiliation(s)
- Acacia F Dishman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI 53226, USA; Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI 53226, USA; Program in Chemical Biology, Medical College of Wisconsin, Milwaukee, WI 53226, USA.
| |
Collapse
|
16
|
Li S, Li Z, Tan GY, Xin Z, Wang W. In vitro allosteric transcription factor-based biosensing. Trends Biotechnol 2023; 41:1080-1095. [PMID: 36967257 DOI: 10.1016/j.tibtech.2023.03.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 02/15/2023] [Accepted: 03/01/2023] [Indexed: 06/18/2023]
Abstract
A biosensor is an analytical device that converts a biological response into a measurable output signal. Bacterial allosteric transcription factors (aTFs) have been utilized as a novel class of recognition elements for in vitro biosensing, which circumvents the limitations of aTF-based whole-cell biosensors (WCBs) and helps to meet the increasing requirement of small-molecule biosensors for diverse applications. In this review, we summarize the recent advances related to the configuration of aTF-based biosensors in vitro. Particularly, we evaluate the advantages of aTFs for in vitro biosensing and highlight their great potential for the establishment of robust and easy-to-implement biosensing strategies. We argue that key technical innovations and generalizable workflows will enhance the pipeline for facile construction of diverse aTF-based small-molecule biosensors.
Collapse
Affiliation(s)
- Shanshan Li
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing 100193, PR China
| | - Zilong Li
- State Key Laboratory of Microbial Resources, Institute of Microbiology, CAS, Beijing 100101, PR China
| | - Gao-Yi Tan
- State Key Laboratory of Bioreactor Engineering and School of Biotechnology, East China University of Science and Technology, Shanghai 200237, PR China
| | - Zhenguo Xin
- State Key Laboratory of Microbial Resources, Institute of Microbiology, CAS, Beijing 100101, PR China; University of Chinese Academy of Sciences, Beijing 100049, PR China
| | - Weishan Wang
- State Key Laboratory of Microbial Resources, Institute of Microbiology, CAS, Beijing 100101, PR China; University of Chinese Academy of Sciences, Beijing 100049, PR China.
| |
Collapse
|
17
|
Nixon C, Lim SA, Sternke M, Barrick D, Harms M, Marqusee S. The importance of input sequence set to consensus-derived proteins and their relationship to reconstructed ancestral proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.29.547063. [PMID: 37425932 PMCID: PMC10327145 DOI: 10.1101/2023.06.29.547063] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
A protein sequence encodes its energy landscape - all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs - questioning the differences and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the overall consensus protein is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted region is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order couplings using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.
Collapse
Affiliation(s)
- Charlotte Nixon
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
| | - Shion A Lim
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
| | - Matt Sternke
- The T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218
| | - Doug Barrick
- The T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218
| | - Mike Harms
- Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR 97403
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA 94720
- Department of Chemistry, University of California, Berkeley, Berkeley, CA 94720
- California Institute for Quantitative Biosciences (QB3), Berkeley
| |
Collapse
|
18
|
Lei L, Zhao L, Hou Y, Yue C, Liu P, Zheng Y, Peng W, Yang J. An Inferred Ancestral CotA Laccase with Improved Expression and Kinetic Efficiency. Int J Mol Sci 2023; 24:10901. [PMID: 37446078 DOI: 10.3390/ijms241310901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 06/17/2023] [Accepted: 06/26/2023] [Indexed: 07/15/2023] Open
Abstract
Laccases are widely used in industrial production due to their broad substrate availability and environmentally friendly nature. However, the pursuit of laccases with superior stability and increased heterogeneous expression to meet industry demands appears to be an ongoing challenge. To address this challenge, we resurrected five ancestral sequences of laccase BsCotA and their homologues. All five variants were successfully expressed in soluble and functional forms with improved expression levels in Escherichia coli. Among the five variants, three exhibited higher catalytic rates, thermal stabilities, and acidic stabilities. Notably, AncCotA2, the best-performing variant, displayed a kcat/KM of 7.5 × 105 M-1·s-1, 5.2-fold higher than that of the wild-type BsCotA, an improved thermo- and acidic stability, and better dye decolorization ability. This study provides a laccase variant with high application potential and presents a new starting point for future enzyme engineering.
Collapse
Affiliation(s)
- Lei Lei
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| | - Lijun Zhao
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| | - Yiqia Hou
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| | - Chen Yue
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| | - Pulin Liu
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| | - Yanli Zheng
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| | - Wenfang Peng
- State Key Laboratory of Biocatalysis and Enzyme Engineering, College of Life Science, Hubei University, Wuhan 430062, China
| | - Jiangke Yang
- School of Life Science and Technology, Wuhan Polytechnic University, Wuhan 430023, China
| |
Collapse
|
19
|
Chiang CH, Wymore T, Rodríguez Benítez A, Hussain A, Smith JL, Brooks CL, Narayan ARH. Deciphering the evolution of flavin-dependent monooxygenase stereoselectivity using ancestral sequence reconstruction. Proc Natl Acad Sci U S A 2023; 120:e2218248120. [PMID: 37014851 PMCID: PMC10104550 DOI: 10.1073/pnas.2218248120] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 03/06/2023] [Indexed: 04/05/2023] Open
Abstract
Controlling the selectivity of a reaction is critical for target-oriented synthesis. Accessing complementary selectivity profiles enables divergent synthetic strategies, but is challenging to achieve in biocatalytic reactions given enzymes' innate preferences of a single selectivity. Thus, it is critical to understand the structural features that control selectivity in biocatalytic reactions to achieve tunable selectivity. Here, we investigate the structural features that control the stereoselectivity in an oxidative dearomatization reaction that is key to making azaphilone natural products. Crystal structures of enantiocomplementary biocatalysts guided the development of multiple hypotheses centered on the structural features that control the stereochemical outcome of the reaction; however, in many cases, direct substitutions of active site residues in natural proteins led to inactive enzymes. Ancestral sequence reconstruction (ASR) and resurrection were employed as an alternative strategy to probe the impact of each residue on the stereochemical outcome of the dearomatization reaction. These studies suggest that two mechanisms are active in controlling the stereochemical outcome of the oxidative dearomatization reaction: one involving multiple active site residues in AzaH and the other dominated by a single Phe to Tyr switch in TropB and AfoD. Moreover, this study suggests that the flavin-dependent monooxygenases (FDMOs) adopt simple and flexible strategies to control stereoselectivity, which has led to stereocomplementary azaphilone natural products produced by fungi. This paradigm of combining ASR and resurrection with mutational and computational studies showcases sets of tools for understanding enzyme mechanisms and provides a solid foundation for future protein engineering efforts.
Collapse
Affiliation(s)
- Chang-Hwa Chiang
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
- Life Sciences Institute, University of Michigan, Ann Arbor, MI48109
| | - Troy Wymore
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
- Department of Chemistry, Stony Brook University, Stony Brook, NY11794
| | - Attabey Rodríguez Benítez
- Life Sciences Institute, University of Michigan, Ann Arbor, MI48109
- Program in Chemical Biology, University of Michigan, Ann Arbor, MI48109
| | - Azam Hussain
- Macromolecular Science and Engineering Program, University of Michigan, Ann Arbor, MI48109
| | - Janet L. Smith
- Life Sciences Institute, University of Michigan, Ann Arbor, MI48109
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI48109
| | - Charles L. Brooks
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
- Program in Chemical Biology, University of Michigan, Ann Arbor, MI48109
- Department of Biophysics, University of Michigan, Ann Arbor, MI48109
| | - Alison R. H. Narayan
- Department of Chemistry, University of Michigan, Ann Arbor, MI48109
- Life Sciences Institute, University of Michigan, Ann Arbor, MI48109
- Program in Chemical Biology, University of Michigan, Ann Arbor, MI48109
| |
Collapse
|
20
|
Livada J, Vargas AM, Martinez CA, Lewis RD. Ancestral Sequence Reconstruction Enhances Gene Mining Efforts for Industrial Ene Reductases by Expanding Enzyme Panels with Thermostable Catalysts. ACS Catal 2023. [DOI: 10.1021/acscatal.2c03859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Affiliation(s)
- Jovan Livada
- Pfizer Global Research and Development, Chemical Research Development, MS 4073 Eastern Point Road, Groton, Connecticut 06340, United States
| | - Ariana M. Vargas
- Pfizer Global Research and Development, Chemical Research Development, MS 4073 Eastern Point Road, Groton, Connecticut 06340, United States
| | - Carlos A. Martinez
- Pfizer Global Research and Development, Chemical Research Development, MS 4073 Eastern Point Road, Groton, Connecticut 06340, United States
| | - Russell D. Lewis
- Pfizer Global Research and Development, Chemical Research Development, MS 4073 Eastern Point Road, Groton, Connecticut 06340, United States
| |
Collapse
|
21
|
Mingarro G, Del Olmo ML. Improvements in the genetic editing technologies: CRISPR-Cas and beyond. Gene 2023; 852:147064. [PMID: 36435506 DOI: 10.1016/j.gene.2022.147064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Revised: 10/31/2022] [Accepted: 11/17/2022] [Indexed: 11/25/2022]
Abstract
Gene editing is a great hope not only for the scientific community, but also for society in general. This is due to its potential therapeutic applications that would allow curing diseases of genetic origin. The first realistic approach to achieve this goal was the development of CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) tools. This review deals with some of the improvements that have been designed to obtain more efficient and safer genome editing. Initial CRISPR-Cas (CRISPR associated) editing systems yield low efficiency and undesired editing products. To solve these problems, new approaches emerged, such as the creation of base editors. Recent discoveries have led to the development of many interesting alternatives, such as the CRISPR-associated transposable systems, which open the range by generating guided insertions, or the discovery of other programmable nucleases like the IscB family, which greatly increase the range of proteins available for editing uses. Also, to address the limitations of base editors, prime editors were created; this novel system, despite having some disadvantages compared to base editor systems, has the potential to generate all the possible point mutations. On the other hand, dual prime editing systems (like twin and homologous 3' extension-mediated prime editors) have been developed to create targeted insertions and enhance the editing outcomes, respectively. Furthermore, advances in gene editing do not reside solely in CRISPR-dependent systems, as we will discuss when treating the Replication Interrupted Template-Driven DNA Modification technique.
Collapse
Affiliation(s)
- Gerard Mingarro
- Departament de Bioquímica i Biologia Molecular, Facultat de Ciències Biològiques, Universitat de València. Burjassot (València), Spain
| | - Marcel Lí Del Olmo
- Departament de Bioquímica i Biologia Molecular, Facultat de Ciències Biològiques, Universitat de València. Burjassot (València), Spain.
| |
Collapse
|
22
|
Zhao F, Akanuma S. Ancestral Sequence Reconstruction of the Ribosomal Protein uS8 and Reduction of Amino Acid Usage to a Smaller Alphabet. J Mol Evol 2023; 91:10-23. [PMID: 36396786 DOI: 10.1007/s00239-022-10078-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 11/08/2022] [Indexed: 11/19/2022]
Abstract
Understanding the origin and early evolution of proteins is important for unveiling how the RNA world developed into an RNA-protein world. Because the composition of organic molecules in the Earth's primitive environment was plausibly not as diverse as today, the number of different amino acids used in early protein synthesis is likely to be substantially less than the current 20 proteinogenic residues. In this study, we have explored the thermal stability and RNA binding of ancestral variants of the ribosomal protein uS8 constructed from a reduced-alphabet of amino acids. First, we built a phylogenetic tree based on the amino acid sequences of uS8 from multiple extant organisms and used the tree to infer two plausible amino acid sequences corresponding to the last bacterial common ancestor of uS8. Both ancestral proteins were thermally stable and bound to an RNA fragment. By eliminating individual amino acid letters and monitoring thermal stability and RNA binding in the resulting proteins, we reduced the size of the amino acid set constituting one of the ancestral proteins, eventually finding that convergent sequences consisting of 15- or 14-amino acid alphabets still folded into stable structures that bound to the RNA fragment. Furthermore, a simplified variant reconstructed from a 13-amino-acid alphabet retained affinity for the RNA fragment, although it lost conformational stability. Collectively, RNA-binding activity may be achieved with a subset of the current 20 amino acids, raising the possibility of a simpler composition of RNA-binding proteins in the earliest stage of protein evolution.
Collapse
Affiliation(s)
- Fangzheng Zhao
- Faculty of Human Sciences, Waseda University, 2-579-15, Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, 2-579-15, Mikajima, Tokorozawa, Saitama, 359-1192, Japan.
| |
Collapse
|
23
|
Clifton BE, Kozome D, Laurino P. Efficient Exploration of Sequence Space by Sequence-Guided Protein Engineering and Design. Biochemistry 2023; 62:210-220. [PMID: 35245020 DOI: 10.1021/acs.biochem.1c00757] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
The rapid growth of sequence databases over the past two decades means that protein engineers faced with optimizing a protein for any given task will often have immediate access to a vast number of related protein sequences. These sequences encode information about the evolutionary history of the protein and the underlying sequence requirements to produce folded, stable, and functional protein variants. Methods that can take advantage of this information are an increasingly important part of the protein engineering tool kit. In this Perspective, we discuss the utility of sequence data in protein engineering and design, focusing on recent advances in three main areas: the use of ancestral sequence reconstruction as an engineering tool to generate thermostable and multifunctional proteins, the use of sequence data to guide engineering of multipoint mutants by structure-based computational protein design, and the use of unlabeled sequence data for unsupervised and semisupervised machine learning, allowing the generation of diverse and functional protein sequences in unexplored regions of sequence space. Altogether, these methods enable the rapid exploration of sequence space within regions enriched with functional proteins and therefore have great potential for accelerating the engineering of stable, functional, and diverse proteins for industrial and biomedical applications.
Collapse
Affiliation(s)
- Ben E Clifton
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Dan Kozome
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| | - Paola Laurino
- Protein Engineering and Evolution Unit, Okinawa Institute of Science and Technology, 1919-1 Tancha, Onna, Okinawa 904-0495, Japan
| |
Collapse
|
24
|
Verma A, Åberg-Zingmark E, Sparrman T, Mushtaq AU, Rogne P, Grundström C, Berntsson R, Sauer UH, Backman L, Nam K, Sauer-Eriksson E, Wolf-Watz M. Insights into the evolution of enzymatic specificity and catalysis: From Asgard archaea to human adenylate kinases. SCIENCE ADVANCES 2022; 8:eabm4089. [PMID: 36332013 PMCID: PMC9635829 DOI: 10.1126/sciadv.abm4089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 09/15/2022] [Indexed: 06/16/2023]
Abstract
Enzymatic catalysis is critically dependent on selectivity, active site architecture, and dynamics. To contribute insights into the interplay of these properties, we established an approach with NMR, crystallography, and MD simulations focused on the ubiquitous phosphotransferase adenylate kinase (AK) isolated from Odinarchaeota (OdinAK). Odinarchaeota belongs to the Asgard archaeal phylum that is believed to be the closest known ancestor to eukaryotes. We show that OdinAK is a hyperthermophilic trimer that, contrary to other AK family members, can use all NTPs for its phosphorylation reaction. Crystallographic structures of OdinAK-NTP complexes revealed a universal NTP-binding motif, while 19F NMR experiments uncovered a conserved and rate-limiting dynamic signature. As a consequence of trimerization, the active site of OdinAK was found to be lacking a critical catalytic residue and is therefore considered to be "atypical." On the basis of discovered relationships with human monomeric homologs, our findings are discussed in terms of evolution of enzymatic substrate specificity and cold adaptation.
Collapse
Affiliation(s)
- Apoorv Verma
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | | | - Tobias Sparrman
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | | | - Per Rogne
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | | | - Ronnie Berntsson
- Department of Medical Biochemistry and Biophysics, Umeå University, 901 87 Umeå, Sweden
- Wallenberg Centre for Molecular Medicine, Umeå University, 901 87 Umeå, Sweden
| | - Uwe H. Sauer
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | - Lars Backman
- Department of Chemistry, Umeå University, 901 87 Umeå, Sweden
| | - Kwangho Nam
- Department of Chemistry and Biochemistry, University of Texas at Arlington, Arlington, TX 76019, USA
| | | | | |
Collapse
|
25
|
Ayuso-Fernández I, Molpeceres G, Camarero S, Ruiz-Dueñas FJ, Martínez AT. Ancestral sequence reconstruction as a tool to study the evolution of wood decaying fungi. FRONTIERS IN FUNGAL BIOLOGY 2022; 3:1003489. [PMID: 37746217 PMCID: PMC10512382 DOI: 10.3389/ffunb.2022.1003489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 09/22/2022] [Indexed: 09/26/2023]
Abstract
The study of evolution is limited by the techniques available to do so. Aside from the use of the fossil record, molecular phylogenetics can provide a detailed characterization of evolutionary histories using genes, genomes and proteins. However, these tools provide scarce biochemical information of the organisms and systems of interest and are therefore very limited when they come to explain protein evolution. In the past decade, this limitation has been overcome by the development of ancestral sequence reconstruction (ASR) methods. ASR allows the subsequent resurrection in the laboratory of inferred proteins from now extinct organisms, becoming an outstanding tool to study enzyme evolution. Here we review the recent advances in ASR methods and their application to study fungal evolution, with special focus on wood-decay fungi as essential organisms in the global carbon cycling.
Collapse
Affiliation(s)
- Iván Ayuso-Fernández
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Gonzalo Molpeceres
- Centro de Investigaciones Biológicas “Margarita Salas” (CIB), CSIC, Madrid, Spain
| | - Susana Camarero
- Centro de Investigaciones Biológicas “Margarita Salas” (CIB), CSIC, Madrid, Spain
| | | | - Angel T. Martínez
- Centro de Investigaciones Biológicas “Margarita Salas” (CIB), CSIC, Madrid, Spain
| |
Collapse
|
26
|
Hobbs HT, Shah NH, Shoemaker SR, Amacher JF, Marqusee S, Kuriyan J. Saturation mutagenesis of a predicted ancestral Syk-family kinase. Protein Sci 2022; 31:e4411. [PMID: 36173161 PMCID: PMC9601881 DOI: 10.1002/pro.4411] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Revised: 06/27/2022] [Accepted: 07/25/2022] [Indexed: 11/08/2022]
Abstract
Many tyrosine kinases cannot be expressed readily in Escherichia coli, limiting facile production of these proteins for biochemical experiments. We used ancestral sequence reconstruction to generate a spleen tyrosine kinase (Syk) variant that can be expressed in bacteria and purified in soluble form, unlike the human members of this family (Syk and zeta-chain-associated protein kinase of 70 kDa [ZAP-70]). The catalytic activity, substrate specificity, and regulation by phosphorylation of this Syk variant are similar to the corresponding properties of human Syk and ZAP-70. Taking advantage of the ability to express this novel Syk-family kinase in bacteria, we developed a two-hybrid assay that couples the growth of E. coli in the presence of an antibiotic to successful phosphorylation of a bait peptide by the kinase. Using this assay, we screened a site-saturation mutagenesis library of the kinase domain of this reconstructed Syk-family kinase. Sites of loss-of-function mutations identified in the screen correlate well with residues established previously as critical to function and/or structure in protein kinases. We also identified activating mutations in the regulatory hydrophobic spine and activation loop, which are within key motifs involved in kinase regulation. Strikingly, one mutation in an ancestral Syk-family variant increases the soluble expression of the protein by 75-fold. Thus, through ancestral sequence reconstruction followed by deep mutational scanning, we have generated Syk-family kinase variants that can be expressed in bacteria with very high yield.
Collapse
Affiliation(s)
- Helen T. Hobbs
- Department of ChemistryUniversity of CaliforniaBerkeleyCaliforniaUSA
- Department of Biomedical EngineeringUniversity of CaliforniaIrvineCaliforniaUSA
| | - Neel H. Shah
- Department of ChemistryColumbia UniversityNew YorkNew YorkUSA
| | - Sophie R. Shoemaker
- Department of Molecular and Cell BiologyUniversity of CaliforniaBerkeleyCaliforniaUSA
| | - Jeanine F. Amacher
- Department of ChemistryWestern Washington UniversityBellinghamWashingtonUSA
| | - Susan Marqusee
- Department of ChemistryUniversity of CaliforniaBerkeleyCaliforniaUSA
- Department of Molecular and Cell BiologyUniversity of CaliforniaBerkeleyCaliforniaUSA
- California Institute for Quantitative BiosciencesUniversity of CaliforniaBerkeleyCaliforniaUSA
| | - John Kuriyan
- Department of ChemistryUniversity of CaliforniaBerkeleyCaliforniaUSA
- Department of Molecular and Cell BiologyUniversity of CaliforniaBerkeleyCaliforniaUSA
- California Institute for Quantitative BiosciencesUniversity of CaliforniaBerkeleyCaliforniaUSA
- Howard Hughes Medical InstituteUniversity of CaliforniaBerkeleyCaliforniaUSA
| |
Collapse
|
27
|
Tang QY, Ren W, Wang J, Kaneko K. The Statistical Trends of Protein Evolution: A Lesson from AlphaFold Database. Mol Biol Evol 2022; 39:6701686. [PMID: 36108094 PMCID: PMC9550990 DOI: 10.1093/molbev/msac197] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The recent development of artificial intelligence provides us with new and powerful tools for studying the mysterious relationship between organism evolution and protein evolution. In this work, based on the AlphaFold Protein Structure Database (AlphaFold DB), we perform comparative analyses of the proteins of different organisms. The statistics of AlphaFold-predicted structures show that, for organisms with higher complexity, their constituent proteins will have larger radii of gyration, higher coil fractions, and slower vibrations, statistically. By conducting normal mode analysis and scaling analyses, we demonstrate that higher organismal complexity correlates with lower fractal dimensions in both the structure and dynamics of the constituent proteins, suggesting that higher functional specialization is associated with higher organismal complexity. We also uncover the topology and sequence bases of these correlations. As the organismal complexity increases, the residue contact networks of the constituent proteins will be more assortative, and these proteins will have a higher degree of hydrophilic-hydrophobic segregation in the sequences. Furthermore, by comparing the statistical structural proximity across the proteomes with the phylogenetic tree of homologous proteins, we show that, statistical structural proximity across the proteomes may indirectly reflect the phylogenetic proximity, indicating a statistical trend of protein evolution in parallel with organism evolution. This study provides new insights into how the diversity in the functionality of proteins increases and how the dimensionality of the manifold of protein dynamics reduces during evolution, contributing to the understanding of the origin and evolution of lives.
Collapse
Affiliation(s)
| | - Weitong Ren
- Theoretical Molecular Science Laboratory, RIKEN Cluster for Pioneering Research, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
| | - Jun Wang
- School of Physics, National Laboratory of Solid State Microstructure, and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, People’s Republic of China
| | | |
Collapse
|
28
|
Harris KL, Thomson RES, Gumulya Y, Foley G, Carrera-Pacheco SE, Syed P, Janosik T, Sandinge AS, Andersson S, Jurva U, Bodén M, Gillam EMJ. Ancestral sequence reconstruction of a cytochrome P450 family involved in chemical defence reveals the functional evolution of a promiscuous, xenobiotic-metabolizing enzyme in vertebrates. Mol Biol Evol 2022; 39:6593376. [PMID: 35639613 PMCID: PMC9185370 DOI: 10.1093/molbev/msac116] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
The cytochrome P450 family 1 enzymes (CYP1s) are a diverse family of hemoprotein monooxygenases, which metabolize many xenobiotics including numerous environmental carcinogens. However, their historical function and evolution remain largely unstudied. Here we investigate CYP1 evolution via the reconstruction and characterization of the vertebrate CYP1 ancestors. Younger ancestors and extant forms generally demonstrated higher activity toward typical CYP1 xenobiotic and steroid substrates than older ancestors, suggesting significant diversification away from the original CYP1 function. Caffeine metabolism appears to be a recently evolved trait of the CYP1A subfamily, observed in the mammalian CYP1A lineage, and may parallel the recent evolution of caffeine synthesis in multiple separate plant species. Likewise, the aryl hydrocarbon receptor agonist, 6-formylindolo[3,2-b]carbazole (FICZ) was metabolized to a greater extent by certain younger ancestors and extant forms, suggesting that activity toward FICZ increased in specific CYP1 evolutionary branches, a process that may have occurred in parallel to the exploitation of land where UV-exposure was higher than in aquatic environments. As observed with previous reconstructions of P450 enzymes, thermostability correlated with evolutionary age; the oldest ancestor was up to 35 °C more thermostable than the extant forms, with a 10T50 (temperature at which 50% of the hemoprotein remains intact after 10 min) of 71 °C. This robustness may have facilitated evolutionary diversification of the CYP1s by buffering the destabilizing effects of mutations that conferred novel functions, a phenomenon which may also be useful in exploiting the catalytic versatility of these ancestral enzymes for commercial application as biocatalysts.
Collapse
Affiliation(s)
- Kurt L Harris
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| | - Raine E S Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| | - Yosephine Gumulya
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| | - Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| | - Saskya E Carrera-Pacheco
- Centro de Investigación Biomédica, Facultad de Ciencias de la Salud Eugenio Espejo, Universidad UTE, Quito 170147, Ecuador
| | - Parnayan Syed
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| | - Tomasz Janosik
- RISE Research Institutes of Sweden, Division Bioeconomy and Health, Chemical Process and Pharmaceutical Development, Södertälje, Sweden
| | - Ann-Sofie Sandinge
- DMPK, Early Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Astrazeneca, Gothenburg, Sweden
| | - Shalini Andersson
- Discovery Sciences, BioPharmaceuticals R&D, Astrazeneca, Gothenburg, Sweden
| | - Ulrik Jurva
- DMPK, Early Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, Astrazeneca, Gothenburg, Sweden
| | - Mikael Bodén
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| | - Elizabeth M J Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072 Australia
| |
Collapse
|
29
|
Karlsson E, Sorgenfrei FA, Andersson E, Dogan J, Jemth P, Chi CN. The dynamic properties of a nuclear coactivator binding domain are evolutionarily conserved. Commun Biol 2022; 5:286. [PMID: 35354917 PMCID: PMC8967867 DOI: 10.1038/s42003-022-03217-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 03/02/2022] [Indexed: 12/21/2022] Open
Abstract
Evolution of proteins is constrained by their structure and function. While there is a consensus that the plasticity of intrinsically disordered proteins relaxes the structural constraints on evolution there is a paucity of data on the molecular details of these processes. The Nuclear Coactivator Binding Domain (NCBD) from CREB-binding protein is a protein interaction domain, which contains a hydrophobic core but is not behaving as a typical globular domain, and has been described as 'molten-globule like'. The highly dynamic properties of NCBD makes it an interesting model system for evolutionary structure-function investigation of intrinsically disordered proteins. We have here compared the structure and biophysical properties of an ancient version of NCBD present in a bilaterian animal ancestor living around 600 million years ago with extant human NCBD. Using a combination of NMR spectroscopy, circular dichroism and kinetics we show that although NCBD has increased its thermodynamic stability, it has retained its dynamic biophysical properties in the ligand-free state in the evolutionary lineage leading from the last common bilaterian ancestor to humans. Our findings suggest that the dynamic properties of NCBD have been maintained by purifying selection and thus are important for its function, which includes mediating several distinct protein-protein interactions.
Collapse
Affiliation(s)
- Elin Karlsson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden
| | - Frieda A Sorgenfrei
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden.,acib GmbH, Krenngasse 37, 8010 Graz c/o University of Graz, Institute of Chemistry, NAWI Graz, BioTechMed Graz, Heinrichstrasse 28, 8010, Graz, Austria
| | - Eva Andersson
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden
| | - Jakob Dogan
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden.
| | - Celestine N Chi
- Department of Medical Biochemistry and Microbiology, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden. .,Department of Pharmaceutical Biosciences, Uppsala University, BMC Box 582, SE-75123, Uppsala, Sweden.
| |
Collapse
|
30
|
Sinnott‐Armstrong MA, Deanna R, Pretz C, Liu S, Harris JC, Dunbar‐Wallis A, Smith SD, Wheeler LC. How to approach the study of syndromes in macroevolution and ecology. Ecol Evol 2022; 12:e8583. [PMID: 35342598 PMCID: PMC8928880 DOI: 10.1002/ece3.8583] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 12/23/2021] [Accepted: 12/31/2021] [Indexed: 11/12/2022] Open
Abstract
Syndromes, wherein multiple traits evolve convergently in response to a shared selective driver, form a central concept in ecology and evolution. Recent work has questioned the existence of some classic syndromes, such as pollination and seed dispersal syndromes. Here, we discuss some of the major issues that have afflicted research into syndromes in macroevolution and ecology. First, correlated evolution of traits and hypothesized selective drivers is often relied on as the only evidence for adaptation of those traits to those hypothesized drivers, without supporting evidence. Second, the selective driver is often inferred from a combination of traits without explicit testing. Third, researchers often measure traits that are easy for humans to observe rather than measuring traits that are suited to testing the hypothesis of adaptation. Finally, species are often chosen for study because of their striking phenotypes, which leads to the illusion of syndromes and divergence. We argue that these issues can be avoided by combining studies of trait variation across entire clades or communities with explicit tests of adaptive hypotheses and that taking this approach will lead to a better understanding of syndrome‐like evolution and its drivers.
Collapse
Affiliation(s)
- Miranda A. Sinnott‐Armstrong
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
- Department of Chemistry University of Cambridge Cambridge UK
| | - Rocio Deanna
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
- Instituto Multidisciplinario de Biología Vegetal IMBIV (CONICET‐UNC) Córdoba Argentina
- Departamento de Ciencias Farmacéuticas Facultad de Ciencias Químicas (FCQ, UNC) Córdoba Argentina
| | - Chelsea Pretz
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
| | - Sukuan Liu
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
| | - Jesse C. Harris
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
| | - Amy Dunbar‐Wallis
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
| | - Stacey D. Smith
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
| | - Lucas C. Wheeler
- Department of Ecology and Evolutionary Biology University of Colorado‐Boulder Boulder Colorado USA
| |
Collapse
|
31
|
Mascotti ML. Resurrecting Enzymes by Ancestral Sequence Reconstruction. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2397:111-136. [PMID: 34813062 DOI: 10.1007/978-1-0716-1826-4_7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Ancestral Sequence Reconstruction (ASR) allows one to infer the sequences of extinct proteins using the phylogeny of extant proteins. It consists of disclosing the evolutionary history-i.e., the phylogeny-of a protein family of interest and then inferring the sequences of its ancestors-i.e., the nodes in the phylogeny. Assisted by gene synthesis, the selected ancestors can be resurrected in the lab and experimentally characterized. The crucial step to succeed with ASR is starting from a reliable phylogeny. At the same time, it is of the utmost importance to have a clear idea on the evolutionary history of the family under study and the events that influenced it. This allows us to implement ASR with well-defined hypotheses and to apply the appropriate experimental methods. In the last years, ASR has become popular to test hypotheses about the origin of functionalities, changes in activities, understanding physicochemical properties of proteins, among others. In this context, the aim of this chapter is to present the ASR approach applied to the reconstruction of enzymes-i.e., proteins with catalytic roles. The spirit of this contribution is to provide a basic, hands-to-work guide for biochemists and biologists who are unfamiliar with molecular phylogenetics.
Collapse
Affiliation(s)
- Maria Laura Mascotti
- Molecular Enzymology group, University of Groningen, Groningen, The Netherlands. .,IMIBIO-SL CONICET, Facultad de Química Bioquímica y Farmacia, Universidad Nacional de San Luis, San Luis, Argentina.
| |
Collapse
|
32
|
Unusual commonality in active site structural features of substrate promiscuous and specialist enzymes. J Struct Biol 2022; 214:107835. [DOI: 10.1016/j.jsb.2022.107835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 12/26/2021] [Accepted: 01/23/2022] [Indexed: 11/21/2022]
|
33
|
Lichman BR. Ancestral Sequence Reconstruction for Exploring Alkaloid Evolution. Methods Mol Biol 2022; 2505:165-179. [PMID: 35732944 DOI: 10.1007/978-1-0716-2349-7_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The complex and bioactive monoterpene indole alkaloids (MIAs) found in Catharanthus roseus and related species are the products of many millions of years of evolution through mutation and natural selection. Ancestral sequence reconstruction (ASR) is a method that combines phylogenetic analysis and experimental biochemistry to infer details about past events in protein evolution. Here, I propose that ASR could be leveraged to understand how enzymes catalyzing the formation of complex alkaloids arose over evolutionary time. I discuss the steps of ASR, including sequence selection, multiple sequence alignment, tree inference, and the generation and characterization of inferred ancestral enzymes.
Collapse
Affiliation(s)
- Benjamin R Lichman
- Centre for Novel Agricultural Products, Department of Biology, University of York, York, UK.
| |
Collapse
|
34
|
Structural dynamics in the evolution of a bilobed protein scaffold. Proc Natl Acad Sci U S A 2021; 118:2026165118. [PMID: 34845009 PMCID: PMC8694067 DOI: 10.1073/pnas.2026165118] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/20/2021] [Indexed: 11/18/2022] Open
Abstract
Proteins conduct numerous complex biological functions by use of tailored structural dynamics. The molecular details of how these emerged from ancestral peptides remains mysterious. How does nature utilize the same repertoire of folds to diversify function? To shed light on this, we analyzed bilobed proteins with a common structural core, which is spread throughout the tree of life and is involved in diverse biological functions such as transcription, enzymatic catalysis, membrane transport, and signaling. We show here that the structural dynamics of the structural core differentiate predominantly via terminal additions during a long-period evolution. This diversifies substrate specificity and, ultimately, biological function. Novel biophysical tools allow the structural dynamics of proteins and the regulation of such dynamics by binding partners to be explored in unprecedented detail. Although this has provided critical insights into protein function, the means by which structural dynamics direct protein evolution remain poorly understood. Here, we investigated how proteins with a bilobed structure, composed of two related domains from the periplasmic-binding protein–like II domain family, have undergone divergent evolution, leading to adaptation of their structural dynamics. We performed a structural analysis on ∼600 bilobed proteins with a common primordial structural core, which we complemented with biophysical studies to explore the structural dynamics of selected examples by single-molecule Förster resonance energy transfer and Hydrogen–Deuterium exchange mass spectrometry. We show that evolutionary modifications of the structural core, largely at its termini, enable distinct structural dynamics, allowing the diversification of these proteins into transcription factors, enzymes, and extracytoplasmic transport-related proteins. Structural embellishments of the core created interdomain interactions that stabilized structural states, reshaping the active site geometry, and ultimately altered substrate specificity. Our findings reveal an as-yet-unrecognized mechanism for the emergence of functional promiscuity during long periods of evolution and are applicable to a large number of domain architectures.
Collapse
|
35
|
Akanuma S, Yamaguchi M, Yamagishi A. Comprehensive mutagenesis to identify amino acid residues contributing to the difference in thermostability between two originally thermostable ancestral proteins. PLoS One 2021; 16:e0258821. [PMID: 34673819 PMCID: PMC8530338 DOI: 10.1371/journal.pone.0258821] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Accepted: 10/05/2021] [Indexed: 11/19/2022] Open
Abstract
Further improvement of the thermostability of inherently thermostable proteins is an attractive challenge because more thermostable proteins are industrially more useful and serve as better scaffolds for protein engineering. To establish guidelines that can be applied for the rational design of hyperthermostable proteins, we compared the amino acid sequences of two ancestral nucleoside diphosphate kinases, Arc1 and Bac1, reconstructed in our previous study. Although Bac1 is a thermostable protein whose unfolding temperature is around 100°C, Arc1 is much more thermostable with an unfolding temperature of 114°C. However, only 12 out of 139 amino acids are different between the two sequences. In this study, one or a combination of amino acid(s) in Bac1 was/were substituted by a residue(s) found in Arc1 at the same position(s). The best mutant, which contained three amino acid substitutions (S108D, G116A and L120P substitutions), showed an unfolding temperature more than 10°C higher than that of Bac1. Furthermore, a combination of the other nine amino acid substitutions also led to improved thermostability of Bac1, although the effects of individual substitutions were small. Therefore, not only the sum of the contributions of individual amino acids, but also the synergistic effects of multiple amino acids are deeply involved in the stability of a hyperthermostable protein. Such insights will be helpful for future rational design of hyperthermostable proteins.
Collapse
Affiliation(s)
- Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, Tokorozawa, Saitama, Japan
- * E-mail:
| | - Minako Yamaguchi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, Hachioji, Tokyo, Japan
| |
Collapse
|
36
|
Kropp C, Bruckmann A, Babinger P. Controlling Enzymatic Activity by Modulating the Oligomerization State via Chemical Rescue and Optical Control. Chembiochem 2021; 23:e202100490. [PMID: 34633135 PMCID: PMC9298306 DOI: 10.1002/cbic.202100490] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/08/2021] [Indexed: 12/22/2022]
Abstract
Selective switching of enzymatic activity has been a longstanding goal in synthetic biology. Drastic changes in activity upon mutational manipulation of the oligomerization state of enzymes have frequently been reported in the literature, but scarcely exploited for switching. Using geranylgeranylglyceryl phosphate synthase as a model, we demonstrate that catalytic activity can be efficiently controlled by exogenous modulation of the association state. We introduced a lysine‐to‐cysteine mutation, leading to the breakdown of the active hexamer into dimers with impaired catalytic efficiency. Addition of bromoethylamine chemically rescued the enzyme by restoring hexamerization and activity. As an alternative method, we incorporated the photosensitive unnatural amino acid o‐nitrobenzyl‐O‐tyrosine (ONBY) into the hexamerization interface. This again led to inactive dimers, but the hexameric state and activity could be recovered by UV‐light induced cleavage of ONBY. For both approaches, we obtained switching factors greater than 350‐fold, which compares favorably with previously reported activity changes that were caused by site‐directed mutagenesis.
Collapse
Affiliation(s)
- Cosimo Kropp
- Institute of Biophysics and Physical Biochemistry, Regensburg Center for Biochemistry, University of Regensburg, 93040, Regensburg, Germany
| | - Astrid Bruckmann
- Institute of Biochemistry, Genetics and Microbiology, Regensburg Center for Biochemistry, University of Regensburg, 93040, Regensburg, Germany
| | - Patrick Babinger
- Institute of Biophysics and Physical Biochemistry, Regensburg Center for Biochemistry, University of Regensburg, 93040, Regensburg, Germany
| |
Collapse
|
37
|
Pinto GP, Corbella M, Demkiv AO, Kamerlin SCL. Exploiting enzyme evolution for computational protein design. Trends Biochem Sci 2021; 47:375-389. [PMID: 34544655 DOI: 10.1016/j.tibs.2021.08.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/18/2021] [Accepted: 08/24/2021] [Indexed: 11/15/2022]
Abstract
Recent years have seen an explosion of interest in understanding the physicochemical parameters that shape enzyme evolution, as well as substantial advances in computational enzyme design. This review discusses three areas where evolutionary information can be used as part of the design process: (i) using ancestral sequence reconstruction (ASR) to generate new starting points for enzyme design efforts; (ii) learning from how nature uses conformational dynamics in enzyme evolution to mimic this process in silico; and (iii) modular design of enzymes from smaller fragments, again mimicking the process by which nature appears to create new protein folds. Using showcase examples, we highlight the importance of incorporating evolutionary information to continue to push forward the boundaries of enzyme design studies.
Collapse
Affiliation(s)
- Gaspar P Pinto
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Marina Corbella
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Andrey O Demkiv
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | | |
Collapse
|
38
|
Abstract
Some have hypothesized that ancestral proteins were, on average, less specific than their descendants. If true, this would provide a universal axis along which to organize protein evolution and suggests that reconstructed ancestral proteins may be uniquely powerful tools for protein engineering. Ancestral sequence reconstruction studies are one line of evidence used to support this hypothesis. Previously, we performed such a study, investigating the evolution of peptide-binding specificity for the paralogs S100A5 and S100A6. The modern proteins appeared more specific than their last common ancestor (ancA5/A6), as each paralog bound a subset of the peptides bound by ancA5/A6. In this study, we revisit this transition, using quantitative phage display to measure the interactions of 30,533 random peptides with human S100A5, S100A6, and ancA5/A6. This unbiased screen reveals a different picture. While S100A5 and S100A6 do indeed bind to a subset of the peptides recognized by ancA5/A6, they also acquired new peptide partners outside of the set recognized by ancA5/A6. Our previous work showed that ancA5/A6 had lower specificity than its descendants when measured against biological targets; our new work shows that ancA5/A6 has similar specificity to the modern proteins when measured against a random set of peptide targets. This demonstrates that altered biological specificity does not necessarily indicate altered intrinsic specificity, and sounds a cautionary note for using ancestral reconstruction studies with biological targets as a means to infer global evolutionary trends in specificity.
Collapse
Affiliation(s)
- Lucas C Wheeler
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA.,Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, USA
| | - Michael J Harms
- Institute of Molecular Biology, University of Oregon, Eugene, OR, USA.,Department of Chemistry and Biochemistry, University of Oregon, Eugene, OR, USA
| |
Collapse
|
39
|
Kozuka K, Nakano S, Asano Y, Ito S. Partial Consensus Design and Enhancement of Protein Function by Secondary-Structure-Guided Consensus Mutations. Biochemistry 2021; 60:2309-2319. [PMID: 34254784 DOI: 10.1021/acs.biochem.1c00309] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Consensus design (CD) is a representative sequence-based protein design method that enables the design of highly functional proteins by analyzing vast amounts of protein sequence data. This study proposes a partial consensus design (PCD) of a protein as a derivative approach of CD. The method replaces the target protein sequence with a consensus sequence in a secondary-structure-dependent manner (i.e., regionally dependent and divided into α-helix, β-sheet, and loop regions). In this study, we generated several artificial partial consensus l-threonine 3-dehydrogenases (PcTDHs) by PCD using the TDH from Cupriavidus necator (CnTDH) as a target protein. Structural and functional analysis of PcTDHs suggested that thermostability would be independently improved when consensus mutations are introduced into the loop region of TDHs. On the other hand, enzyme kinetic parameters (kcat/Km) and average productivity would be synergistically enhanced by changing the combination of the mutations-replacement of one region of CnTDH with a consensus sequence provided only negative effects, but the negative effects were nullified when the two regions were replaced simultaneously. Taken together, we propose the hypothesis that there are protein regions that encode individual protein properties, such as thermostability and activity, and that the introduction of consensus mutations into these regions could additively or synergistically modify their functions.
Collapse
Affiliation(s)
- Kohei Kozuka
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan
| | - Shogo Nakano
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan.,PREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
| | - Yasuhisa Asano
- Biotechnology Research Center and Department of Biotechnology, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama 939-0398, Japan
| | - Sohei Ito
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka, 422-8526, Japan
| |
Collapse
|
40
|
Khan RT, Musil M, Stourac J, Damborsky J, Bednar D. Fully Automated Ancestral Sequence Reconstruction using FireProt ASR. Curr Protoc 2021; 1:e30. [PMID: 33524240 DOI: 10.1002/cpz1.30] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Protein evolution and protein engineering techniques are of great interest in basic science and industrial applications such as pharmacology, medicine, or biotechnology. Ancestral sequence reconstruction (ASR) is a powerful technique for probing evolutionary relationships and engineering robust proteins with good thermostability and broad substrate specificity. The following protocol describes the setting up and execution of an automated FireProtASR workflow using a dedicated web site. The service allows for inference of ancestral proteins automatically, from a single protein sequence. Once a protein sequence is submitted, the server will build a dataset of homology sequences, perform a multiple sequence alignment (MSA), build a phylogenetic tree, and reconstruct ancestral nodes. The protocol is also highly flexible and allows for multiple forms of input, advanced settings, and the ability to start jobs from: (i) a single sequence, (ii) a set of homologous sequences, (iii) an MSA, and (iv) a phylogenetic tree. This approach automates all necessary steps and offers a way for novices with limited exposure to ASR techniques to improve the properties of a protein of interest. The technique can even be used to introduce catalytic promiscuity into an enzyme. A web server for accessing the fully automated workflow is freely accessible at https://loschmidt.chemi.muni.cz/fireprotasr/. © 2021 Wiley Periodicals LLC. Basic Protocol: ASR using the Web Server FireProtASR.
Collapse
Affiliation(s)
- Rayyan Tariq Khan
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic.,Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.,International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| |
Collapse
|
41
|
Laursen L, Čalyševa J, Gibson TJ, Jemth P. Divergent Evolution of a Protein-Protein Interaction Revealed through Ancestral Sequence Reconstruction and Resurrection. Mol Biol Evol 2021; 38:152-167. [PMID: 32750125 PMCID: PMC7782867 DOI: 10.1093/molbev/msaa198] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The postsynaptic density extends across the postsynaptic dendritic spine with discs large (DLG) as the most abundant scaffolding protein. DLG dynamically alters the structure of the postsynaptic density, thus controlling the function and distribution of specific receptors at the synapse. DLG contains three PDZ domains and one important interaction governing postsynaptic architecture is that between the PDZ3 domain from DLG and a protein called cysteine-rich interactor of PDZ3 (CRIPT). However, little is known regarding functional evolution of the PDZ3:CRIPT interaction. Here, we subjected PDZ3 and CRIPT to ancestral sequence reconstruction, resurrection, and biophysical experiments. We show that the PDZ3:CRIPT interaction is an ancient interaction, which was likely present in the last common ancestor of Eukaryotes, and that high affinity is maintained in most extant animal phyla. However, affinity is low in nematodes and insects, raising questions about the physiological function of the interaction in species from these animal groups. Our findings demonstrate how an apparently established protein–protein interaction involved in cellular scaffolding in bilaterians can suddenly be subject to dynamic evolution including possible loss of function.
Collapse
Affiliation(s)
- Louise Laursen
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Jelena Čalyševa
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.,Faculty of Biosciences, Collaboration for Joint PhD Degree between EMBL and Heidelberg University
| | - Toby J Gibson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Per Jemth
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
42
|
Spence MA, Kaczmarski JA, Saunders JW, Jackson CJ. Ancestral sequence reconstruction for protein engineers. Curr Opin Struct Biol 2021; 69:131-141. [PMID: 34023793 DOI: 10.1016/j.sbi.2021.04.001] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 03/22/2021] [Accepted: 04/07/2021] [Indexed: 12/11/2022]
Abstract
In addition to its value in the study of molecular evolution, ancestral sequence reconstruction (ASR) has emerged as a useful methodology for engineering proteins with enhanced properties. Proteins generated by ASR often exhibit unique or improved activity, stability, and/or promiscuity, all of which are properties that are valued by protein engineers. Comparison between extant proteins and evolutionary intermediates generated by ASR also allows protein engineers to identify substitutions that have contributed to functional innovation or diversification within protein families. As ASR becomes more widely adopted as a protein engineering approach, it is important to understand the applications, limitations, and recent developments of this technique. This review highlights recent exemplifications of ASR, as well as technical aspects of the reconstruction process that are relevant to protein engineering.
Collapse
Affiliation(s)
- Matthew A Spence
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Joe A Kaczmarski
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Jake W Saunders
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Peptide & Protein Science, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Synthetic Biology, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia.
| |
Collapse
|
43
|
Ngo K, Bruno da Silva F, Leite VBP, Contessoto VG, Onuchic JN. Improving the Thermostability of Xylanase A from Bacillus subtilis by Combining Bioinformatics and Electrostatic Interactions Optimization. J Phys Chem B 2021; 125:4359-4367. [PMID: 33887137 DOI: 10.1021/acs.jpcb.1c01253] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The rational improvement of the enzyme catalytic activity is one of the most significant challenges in biotechnology. Most conventional strategies used to engineer enzymes involve selecting mutations to increase their thermostability. Determining good criteria for choosing these substitutions continues to be a challenge. In this work, we combine bioinformatics, electrostatic analysis, and molecular dynamics to predict beneficial mutations that may improve the thermostability of XynA from Bacillus subtilis. First, the Tanford-Kirkwood surface accessibility method is used to characterize each ionizable residue contribution to the protein native state stability. Residues identified to be destabilizing were mutated with the corresponding residues determined by the consensus or ancestral sequences at the same locations. Five mutants (K99T/N151D, K99T, S31R, N151D, and K154A) were investigated and compared with 12 control mutants derived from experimental approaches from the literature. Molecular dynamics results show that the mutants exhibited folding temperatures in the order K99T > K99T/N151D > S31R > N151D > WT > K154A. The combined approaches employed provide an effective strategy for low-cost enzyme optimization needed for large-scale biotechnological and medical applications.
Collapse
Affiliation(s)
- Khoa Ngo
- Department of Physics, University of Houston, Houston, Texas 77004, United States
| | - Fernando Bruno da Silva
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas UNESP - Univ. Estadual Paulista, São José do Rio Preto, SP, Brazil
| | - Vitor B P Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas UNESP - Univ. Estadual Paulista, São José do Rio Preto, SP, Brazil
| | - Vinícius G Contessoto
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas UNESP - Univ. Estadual Paulista, São José do Rio Preto, SP, Brazil
| | | |
Collapse
|
44
|
A meta-analysis of the activity, stability, and mutational characteristics of temperature-adapted enzymes. Biosci Rep 2021; 41:228416. [PMID: 33871022 PMCID: PMC8150157 DOI: 10.1042/bsr20210336] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 03/29/2021] [Accepted: 04/19/2021] [Indexed: 11/17/2022] Open
Abstract
Understanding the characteristics that define temperature-adapted enzymes has been a major goal of extremophile enzymology in recent decades. In the present study, we explore these characteristics by comparing psychrophilic, mesophilic, and thermophilic enzymes. Through a meta-analysis of existing data, we show that psychrophilic enzymes exhibit a significantly larger gap (Tg) between their optimum and melting temperatures compared with mesophilic and thermophilic enzymes. These results suggest that Tg may be a useful indicator as to whether an enzyme is psychrophilic or not and that models of psychrophilic enzyme catalysis need to account for this gap. Additionally, by using predictive protein stability software, HoTMuSiC and PoPMuSiC, we show that the deleterious nature of amino acid substitutions to protein stability increases from psychrophiles to thermophiles. How this ultimately affects the mutational tolerance and evolutionary rate of temperature adapted organisms is currently unknown.
Collapse
|
45
|
Irumagawa S, Kobayashi K, Saito Y, Miyata T, Umetsu M, Kameda T, Arai R. Rational thermostabilisation of four-helix bundle dimeric de novo proteins. Sci Rep 2021; 11:7526. [PMID: 33824364 PMCID: PMC8024369 DOI: 10.1038/s41598-021-86952-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Accepted: 03/22/2021] [Indexed: 11/29/2022] Open
Abstract
The stability of proteins is an important factor for industrial and medical applications. Improving protein stability is one of the main subjects in protein engineering. In a previous study, we improved the stability of a four-helix bundle dimeric de novo protein (WA20) by five mutations. The stabilised mutant (H26L/G28S/N34L/V71L/E78L, SUWA) showed an extremely high denaturation midpoint temperature (Tm). Although SUWA is a remarkably hyperstable protein, in protein design and engineering, it is an attractive challenge to rationally explore more stable mutants. In this study, we predicted stabilising mutations of WA20 by in silico saturation mutagenesis and molecular dynamics simulation, and experimentally confirmed three stabilising mutations of WA20 (N22A, N22E, and H86K). The stability of a double mutant (N22A/H86K, rationally optimised WA20, ROWA) was greatly improved compared with WA20 (ΔTm = 10.6 °C). The model structures suggested that N22A enhances the stability of the α-helices and N22E and H86K contribute to salt-bridge formation for protein stabilisation. These mutations were also added to SUWA and improved its Tm. Remarkably, the most stable mutant of SUWA (N22E/H86K, rationally optimised SUWA, ROSA) showed the highest Tm (129.0 °C). These new thermostable mutants will be useful as a component of protein nanobuilding blocks to construct supramolecular protein complexes.
Collapse
Affiliation(s)
- Shin Irumagawa
- Department of Science and Technology, Graduate School of Medicine, Science and Technology, Shinshu University, Ueda, Nagano, 386-8567, Japan
- Department of Biomolecular Innovation, Institute for Biomedical Sciences, Interdisciplinary Cluster for Cutting Edge Research, Shinshu University, Matsumoto, Nagano, 390-8621, Japan
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, Ueda, Nagano, 386-8567, Japan
| | - Kaito Kobayashi
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, 135-0064, Japan
| | - Yutaka Saito
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, 135-0064, Japan
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, 169-8555, Japan
- Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, 277-8561, Japan
| | - Takeshi Miyata
- Department of Biochemistry and Biotechnology, Faculty of Agriculture, Kagoshima University, Kagoshima, 890-0065, Japan
| | - Mitsuo Umetsu
- Department of Biomolecular Engineering, Graduate School of Engineering, Tohoku University, Sendai, 980-8579, Japan
| | - Tomoshi Kameda
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, 135-0064, Japan
| | - Ryoichi Arai
- Department of Science and Technology, Graduate School of Medicine, Science and Technology, Shinshu University, Ueda, Nagano, 386-8567, Japan.
- Department of Biomolecular Innovation, Institute for Biomedical Sciences, Interdisciplinary Cluster for Cutting Edge Research, Shinshu University, Matsumoto, Nagano, 390-8621, Japan.
- Department of Applied Biology, Faculty of Textile Science and Technology, Shinshu University, Ueda, Nagano, 386-8567, Japan.
| |
Collapse
|
46
|
Wang CK, Craik DJ. Linking molecular evolution to molecular grafting. J Biol Chem 2021; 296:100425. [PMID: 33600801 PMCID: PMC8005815 DOI: 10.1016/j.jbc.2021.100425] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 02/09/2021] [Accepted: 02/13/2021] [Indexed: 12/01/2022] Open
Abstract
Molecular grafting is a strategy for the engineering of molecular scaffolds into new functional agents, such as next-generation therapeutics. Despite its wide use, studies so far have focused almost exclusively on demonstrating its utility rather than understanding the factors that lead to either poor or successful grafting outcomes. Here, we examine protein evolution and identify parallels between the natural process of protein functional diversification and the artificial process of molecular grafting. We discuss features of natural proteins that are correlated to innovability-the capacity to acquire new functions-and describe their implications to molecular grafting scaffolds. Disulfide-rich peptides are used as exemplars because they are particularly promising scaffolds onto which new functions can be grafted. This article provides a perspective on why some scaffolds are more suitable for grafting than others, identifying opportunities on how molecular grafting might be improved.
Collapse
Affiliation(s)
- Conan K Wang
- Institute for Molecular Bioscience and Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Queensland, Brisbane, Queensland, Australia.
| | - David J Craik
- Institute for Molecular Bioscience and Australian Research Council Centre of Excellence for Innovations in Peptide and Protein Science, The University of Queensland, Brisbane, Queensland, Australia
| |
Collapse
|
47
|
Crippa M, Andreghetti D, Capelli R, Tiana G. Evolution of frustrated and stabilising contacts in reconstructed ancient proteins. EUROPEAN BIOPHYSICS JOURNAL 2021; 50:699-712. [PMID: 33569610 PMCID: PMC8260555 DOI: 10.1007/s00249-021-01500-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 12/14/2020] [Accepted: 01/13/2021] [Indexed: 11/30/2022]
Abstract
Energetic properties of a protein are a major determinant of its evolutionary fitness. Using a reconstruction algorithm, dating the reconstructed proteins and calculating the interaction network between their amino acids through a coevolutionary approach, we studied how the interactions that stabilise 890 proteins, belonging to five families, evolved for billions of years. In particular, we focused our attention on the network of most strongly attractive contacts and on that of poorly optimised, frustrated contacts. Our results support the idea that the cluster of most attractive interactions extends its size along evolutionary time, but from the data, we cannot conclude that protein stability or that the degree of frustration tends always to decrease.
Collapse
Affiliation(s)
- Martina Crippa
- Department of Physics and Center for Complexity and Biosystems, Università degli Studi di Milano and INFN, via Celoria 16, 20133, Milan, Italy
- Department of Applied Science and Technology, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129, Turin, Italy
| | - Damiano Andreghetti
- Department of Physics and Center for Complexity and Biosystems, Università degli Studi di Milano and INFN, via Celoria 16, 20133, Milan, Italy
| | - Riccardo Capelli
- Department of Applied Science and Technology, Politecnico di Torino, Corso Duca degli Abruzzi 24, 10129, Turin, Italy
| | - Guido Tiana
- Department of Physics and Center for Complexity and Biosystems, Università degli Studi di Milano and INFN, via Celoria 16, 20133, Milan, Italy.
| |
Collapse
|
48
|
Perez-Garcia P, Kobus S, Gertzen CGW, Hoeppner A, Holzscheck N, Strunk CH, Huber H, Jaeger KE, Gohlke H, Kovacic F, Smits SHJ, Streit WR, Chow J. A promiscuous ancestral enzyme´s structure unveils protein variable regions of the highly diverse metallo-β-lactamase family. Commun Biol 2021; 4:132. [PMID: 33514861 PMCID: PMC7846560 DOI: 10.1038/s42003-021-01671-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Accepted: 01/06/2021] [Indexed: 01/30/2023] Open
Abstract
The metallo-β-lactamase fold is an ancient protein structure present in numerous enzyme families responsible for diverse biological processes. The crystal structure of the hyperthermostable crenarchaeal enzyme Igni18 from Ignicoccus hospitalis was solved at 2.3 Å and could resemble a possible first archetype of a multifunctional metallo-β-lactamase. Ancestral enzymes at the evolutionary origin are believed to be promiscuous all-rounders. Consistently, Igni18´s activity can be cofactor-dependently directed from β-lactamase to lactonase, lipase, phosphodiesterase, phosphotriesterase or phospholipase. Its core-domain is highly conserved within metallo-β-lactamases from Bacteria, Archaea and Eukarya and gives insights into evolution and function of enzymes from this superfamily. Structural alignments with diverse metallo-β-lactamase-fold-containing enzymes allowed the identification of Protein Variable Regions accounting for modulation of activity, specificity and oligomerization patterns. Docking of different substrates within the active sites revealed the basis for the crucial cofactor dependency of this enzyme superfamily.
Collapse
Affiliation(s)
- Pablo Perez-Garcia
- Department of Microbiology and Biotechnology, University of Hamburg, Ohnhorststrasse 18, 22609, Hamburg, Germany
| | - Stefanie Kobus
- Center for Structural Studies (CSS), Heinrich Heine University Düsseldorf, Universitätsstrasse 1, 40225, Düsseldorf, Germany
| | - Christoph G W Gertzen
- Center for Structural Studies (CSS), Heinrich Heine University Düsseldorf, Universitätsstrasse 1, 40225, Düsseldorf, Germany
| | - Astrid Hoeppner
- Center for Structural Studies (CSS), Heinrich Heine University Düsseldorf, Universitätsstrasse 1, 40225, Düsseldorf, Germany
| | - Nicholas Holzscheck
- Department of Microbiology and Biotechnology, University of Hamburg, Ohnhorststrasse 18, 22609, Hamburg, Germany
| | - Christoph Heinrich Strunk
- Institute of Molecular Enzyme Technology (IMET), Heinrich Heine University Düsseldorf, 52426, Jülich, Germany
| | - Harald Huber
- Institute for Microbiology and Archaeal Center, Regensburg University, 93035, Regensburg, Germany
| | - Karl-Erich Jaeger
- Institute of Molecular Enzyme Technology (IMET), Heinrich Heine University Düsseldorf, 52426, Jülich, Germany
- Institute of Bio- and Geosciences IBG-1: Biotechnology, Forschungszentrum Jülich GmbH, 52426, Jülich, Germany
| | - Holger Gohlke
- John von Neumann Institute for Computing (NIC), Jülich Supercomputing Centre (JSC) & Institute of Biological Information Processing (IBI-7: Structural Biochemistry), Forschungszentrum Jülich GmbH, 52425, Jülich, Germany
- Institute for Pharmaceutical and Medicinal Chemistry, Heinrich Heine University Düsseldorf, 40225, Düsseldorf, Germany
| | - Filip Kovacic
- Institute of Molecular Enzyme Technology (IMET), Heinrich Heine University Düsseldorf, 52426, Jülich, Germany
| | - Sander H J Smits
- Center for Structural Studies (CSS), Heinrich Heine University Düsseldorf, Universitätsstrasse 1, 40225, Düsseldorf, Germany
- Institute of Biochemistry, Heinrich Heine University Düsseldorf, 40225, Düsseldorf, Germany
| | - Wolfgang R Streit
- Department of Microbiology and Biotechnology, University of Hamburg, Ohnhorststrasse 18, 22609, Hamburg, Germany
| | - Jennifer Chow
- Department of Microbiology and Biotechnology, University of Hamburg, Ohnhorststrasse 18, 22609, Hamburg, Germany.
| |
Collapse
|
49
|
Musil M, Khan RT, Beier A, Stourac J, Konegger H, Damborsky J, Bednar D. FireProtASR: A Web Server for Fully Automated Ancestral Sequence Reconstruction. Brief Bioinform 2020; 22:6042664. [PMID: 33346815 PMCID: PMC8294521 DOI: 10.1093/bib/bbaa337] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 10/12/2020] [Indexed: 12/13/2022] Open
Abstract
There is a great interest in increasing proteins’ stability to widen their usability in numerous biomedical and biotechnological applications. However, native proteins cannot usually withstand the harsh industrial environment, since they are evolved to function under mild conditions. Ancestral sequence reconstruction is a well-established method for deducing the evolutionary history of genes. Besides its applicability to discover the most probable evolutionary ancestors of the modern proteins, ancestral sequence reconstruction has proven to be a useful approach for the design of highly stable proteins. Recently, several computational tools were developed, which make the ancestral reconstruction algorithms accessible to the community, while leaving the most crucial steps of the preparation of the input data on users’ side. FireProtASR aims to overcome this obstacle by constructing a fully automated workflow, allowing even the unexperienced users to obtain ancestral sequences based on a sequence query as the only input. FireProtASR is complemented with an interactive, easy-to-use web interface and is freely available at https://loschmidt.chemi.muni.cz/fireprotasr/.
Collapse
Affiliation(s)
| | | | - Andy Beier
- Loschmidt Laboratories, Masaryk University
| | | | | | - Jiri Damborsky
- International Clinical Research Center at St. Ann's Teaching Hospital
| | | |
Collapse
|
50
|
Motoyama T, Hiramatsu N, Asano Y, Nakano S, Ito S. Protein Sequence Selection Method That Enables Full Consensus Design of Artificial l-Threonine 3-Dehydrogenases with Unique Enzymatic Properties. Biochemistry 2020; 59:3823-3833. [PMID: 32945652 DOI: 10.1021/acs.biochem.0c00570] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Exponentially increasing protein sequence data enables artificial enzyme design using sequence-based protein design methods, including full-consensus protein design (FCD). The success of artificial enzyme design is strongly dependent on the nature of the sequences used. Hence, sequences must be selected from databases and curated libraries prepared to enable a successful design by FCD. In this study, we proposed a selection approach regarding several key residues as sequence motifs. We used l-threonine 3-dehydrogenase (TDH) as a model to test the validity of this approach. In the classification, four residues (143, 174, 188, and 214) were used as key residues. We classified thousands of TDH homologous sequences into five groups containing hundreds of sequences. Utilizing sequences in the libraries, we designed five artificial TDHs by FCD. Among the five, we successfully expressed four in soluble form. Biochemical analysis of artificial TDHs indicated that their enzymatic properties vary; half of the maximum measured enzyme activity (t1/2) and activation energies were distributed from 53 to 65 °C and from 38 to 125 kJ/mol, respectively. The artificial TDHs had unique kinetic parameters, distinct from one another. Structural analysis indicates that consensus mutations are mainly introduced in the secondary or outer shell. The functional diversity of the artificial TDHs is due to the accumulation of mutations that affect their physicochemical properties. Taken together, our findings indicate that our proposed approach can help generate artificial enzymes with unique enzymatic properties.
Collapse
Affiliation(s)
- Tomoharu Motoyama
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Nozomi Hiramatsu
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Yasuhisa Asano
- Biotechnology Research Center and Department of Biotechnology, Toyama Prefectural University, 5180 Kurokawa, Imizu, Toyama 939-0398, Japan
| | - Shogo Nakano
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| | - Sohei Ito
- Graduate School of Integrated Pharmaceutical and Nutritional Sciences, University of Shizuoka, 52-1 Yada, Suruga-ku, Shizuoka 422-8526, Japan
| |
Collapse
|