1
|
Kurahashi R, Tanaka SI, Takano K. Highly active enzymes produced by directed evolution with stability-based selection. Enzyme Microb Technol 2020; 140:109626. [DOI: 10.1016/j.enzmictec.2020.109626] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 06/12/2020] [Accepted: 06/12/2020] [Indexed: 12/22/2022]
|
2
|
Activity-stability trade-off in random mutant proteins. J Biosci Bioeng 2019; 128:405-409. [DOI: 10.1016/j.jbiosc.2019.03.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 03/18/2019] [Accepted: 03/27/2019] [Indexed: 12/23/2022]
|
3
|
Finch AJ, Kim JR. Thermophilic Proteins as Versatile Scaffolds for Protein Engineering. Microorganisms 2018; 6:microorganisms6040097. [PMID: 30257429 PMCID: PMC6313779 DOI: 10.3390/microorganisms6040097] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2018] [Revised: 09/23/2018] [Accepted: 09/23/2018] [Indexed: 01/18/2023] Open
Abstract
Literature from the past two decades has outlined the existence of a trade-off between protein stability and function. This trade-off creates a unique challenge for protein engineers who seek to introduce new functionality to proteins. These engineers must carefully balance the mutation-mediated creation and/or optimization of function with the destabilizing effect of those mutations. Subsequent research has shown that protein stability is positively correlated with "evolvability" or the ability to support mutations which bestow new functionality on the protein. Since the ultimate goal of protein engineering is to create and/or optimize a protein's function, highly stable proteins are preferred as potential scaffolds for protein engineering. This review focuses on the application potential for thermophilic proteins as scaffolds for protein engineering. The relatively high inherent thermostability of these proteins grants them a great deal of mutational robustness, making them promising scaffolds for various protein engineering applications. Comparative studies on the evolvability of thermophilic and mesophilic proteins have strongly supported the argument that thermophilic proteins are more evolvable than mesophilic proteins. These findings indicate that thermophilic proteins may represent the scaffold of choice for protein engineering in the future.
Collapse
Affiliation(s)
- Anthony J Finch
- Department of Chemical and Biomolecular Engineering, New York University, 6 MetroTech Center, Brooklyn, NY 11201, USA.
| | - Jin Ryoun Kim
- Department of Chemical and Biomolecular Engineering, New York University, 6 MetroTech Center, Brooklyn, NY 11201, USA.
| |
Collapse
|
4
|
The direction of protein evolution is destined by the stability. Biochimie 2018; 150:100-109. [DOI: 10.1016/j.biochi.2018.05.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 05/15/2018] [Indexed: 01/29/2023]
|
5
|
Protein Evolution is Potentially Governed by Protein Stability: Directed Evolution of an Esterase from the Hyperthermophilic Archaeon Sulfolobus tokodaii. J Mol Evol 2018; 86:283-292. [DOI: 10.1007/s00239-018-9843-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 04/18/2018] [Indexed: 11/27/2022]
|
6
|
Shah V, Kim JR. Transposon for protein engineering. Mob Genet Elements 2016; 6:e1239601. [PMID: 28090378 DOI: 10.1080/2159256x.2016.1239601] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2016] [Accepted: 09/14/2016] [Indexed: 12/21/2022] Open
Abstract
Protein insertional fusion and circular permutation are 2 promising protein engineering techniques for creating integrated functionalities and sequence diversity of a protein, respectively. Finding insertion locations for protein insertional fusion and new termini for circular permutation through a rational approach is not always straightforward, especially, for proteins without detailed structural knowledge. On the contrary, a combinatorial approach facilitates a comprehensive search to evaluate all potential insertion sites and new termini locations. Conventional methods used to create random insertional fusion libraries generate sub-optimal inter-domain linker length and composition between fused proteins. There are also methods available for construction of random circular permutation libraries. However, these methods too, impose many drawbacks, such as significant sequence modification at the new termini of circular permutants and additionally, require re-design of transposons for tailored expression of circular permutants. Furthermore, these conventional methods employ relatively inefficient blunt-end ligation during library construction. In this commentary, we present a concise overview and key findings of engineered Mu transposons, which have recently been developed in our group as a facile and efficient tool to alleviate limitations realized from conventional methods and to construct high quality libraries for random insertional fusion and random circular permutation.
Collapse
Affiliation(s)
- Vandan Shah
- Othmer-Jacobs Department of Chemical and Biomolecular Engineering, New York University , Brooklyn, NY, USA
| | - Jin Ryoun Kim
- Othmer-Jacobs Department of Chemical and Biomolecular Engineering, New York University , Brooklyn, NY, USA
| |
Collapse
|
7
|
Shahzad K, Mittenthal JE, Caetano-Anollés G. The organization of domains in proteins obeys Menzerath-Altmann's law of language. BMC SYSTEMS BIOLOGY 2015; 9:44. [PMID: 26260760 PMCID: PMC4531524 DOI: 10.1186/s12918-015-0192-9] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 07/30/2015] [Indexed: 11/10/2022]
Abstract
BACKGROUND The combination of domains in multidomain proteins enhances their function and structure but lengthens the molecules and increases their cost at cellular level. METHODS The dependence of domain length on the number of domains a protein holds was surveyed for a set of 60 proteomes representing free-living organisms from all kingdoms of life. Distributions were fitted using non-linear functions and fitted parameters interpreted with a formulation of decreasing returns. RESULTS We find that domain length decreases with increasing number of domains in proteins, following the Menzerath-Altmann (MA) law of language. Highly significant negative correlations exist for the set of proteomes examined. Mathematically, the MA law expresses as a power law relationship that unfolds when molecular persistence P is a function of domain accretion. P holds two terms, one reflecting the matter-energy cost of adding domains and extending their length, the other reflecting how domain length and number impinges on information and biophysics. The pattern of diminishing returns can therefore be explained as a frustrated interplay between the strategies of economy, flexibility and robustness, matching previously observed trade-offs in the domain makeup of proteomes. Proteomes of Archaea, Fungi and to a lesser degree Plants show the largest push towards molecular economy, each at their own economic stratum. Fungi increase domain size in single domain proteins while reinforcing the pattern of diminishing returns. In contrast, Metazoa, and to lesser degrees Protista and Bacteria, relax economy. Metazoa achieves maximum flexibility and robustness by harboring compact molecules and complex domain organization, offering a new functional vocabulary for molecular biology. CONCLUSIONS The tendency of parts to decrease their size when systems enlarge is universal for language and music, and now for parts of macromolecules, extending the MA law to natural systems.
Collapse
Affiliation(s)
| | - Jay E Mittenthal
- Department of Cell and Developmental Biology, Urbana, IL, 61801, USA.
| | - Gustavo Caetano-Anollés
- Illinois Informatics Institute, Urbana, IL, 61801, USA. .,Department of Crop Sciences, Evolutionary Bioinformatics Laboratory, University of Illinois, 332 NSRC, Urbana, IL, 61801, USA.
| |
Collapse
|
8
|
Egel R. Origins and emergent evolution of life: the colloid microsphere hypothesis revisited. ORIGINS LIFE EVOL B 2014; 44:87-110. [PMID: 25208738 DOI: 10.1007/s11084-014-9363-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2014] [Accepted: 08/14/2014] [Indexed: 11/28/2022]
Abstract
Self-replicating molecules, in particular RNA, have long been assumed as key to origins of life on Earth. This notion, however, is not very secure since the reduction of life's complexity to self-replication alone relies on thermodynamically untenable assumptions. Alternative, earlier hypotheses about peptide-dominated colloid self-assembly should be revived. Such macromolecular conglomerates presumably existed in a dynamic equilibrium between confluent growth in sessile films and microspheres detached in turbulent suspension. The first organic syntheses may have been driven by mineral-assisted photoactivation at terrestrial geothermal fields, allowing photo-dependent heterotrophic origins of life. Inherently endowed with rudimentary catalyst activities, mineral-associated organic microstructures can have evolved adaptively toward cooperative 'protolife' communities, in which 'protoplasmic continuity' was maintained throughout a graded series of 'proto-biofilms', 'protoorganisms' and 'protocells' toward modern life. The proneness of organic microspheres to merge back into the bulk of sessile films by spontaneous fusion can have made large populations promiscuous from the beginning, which was important for the speed of collective evolution early on. In this protein-centered scenario, the emergent coevolution of uncoded peptides, metabolic cofactors and oligoribonucleotides was primarily optimized for system-supporting catalytic capabilities arising from nonribosomal peptide synthesis and nonreplicative ribonucleotide polymerization, which in turn incorporated other reactive micromolecular organics as vitamins and cofactors into composite macromolecular colloid films and microspheres. Template-dependent replication and gene-encoded protein synthesis emerged as secondary means for further optimization of overall efficieny later on. Eventually, Darwinian speciation of cell-like lineages commenced after minimal gene sets had been bundled in transmissible genomes from multigenomic protoorganisms.
Collapse
Affiliation(s)
- Richard Egel
- Department of Biology, University of Copenhagen Biocenter, Ole Maaløes Vej 5, DK-2200, Copenhagen, Denmark,
| |
Collapse
|
9
|
Raghunathan G, Soundrarajan N, Sokalingam S, Yun H, Lee SG. Deletional protein engineering based on stable fold. PLoS One 2012; 7:e51510. [PMID: 23240034 PMCID: PMC3519881 DOI: 10.1371/journal.pone.0051510] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Accepted: 11/01/2012] [Indexed: 12/14/2022] Open
Abstract
Diversification of protein sequence-structure space is a major concern in protein engineering. Deletion mutagenesis can generate a protein sequence-structure space different from substitution mutagenesis mediated space, but it has not been widely used in protein engineering compared to substitution mutagenesis, because it causes a relatively huge range of structural perturbations of target proteins which often inactivates the proteins. In this study, we demonstrate that, using green fluorescent protein (GFP) as a model system, the drawback of the deletional protein engineering can be overcome by employing the protein structure with high stability. The systematic dissection of N-terminal, C-terminal and internal sequences of GFPs with two different stabilities showed that GFP with high stability (s-GFP), was more tolerant to the elimination of amino acids compared to a GFP with normal stability (n-GFP). The deletion studies of s-GFP enabled us to achieve three interesting variants viz. s-DL4, s-N14, and s-C225, which could not been obtained from n-GFP. The deletion of 191–196 loop sequences led to the variant s-DL4 that was expressed predominantly as insoluble form but mostly active. The s-N14 and s-C225 are the variants without the amino acid residues involving secondary structures around N- and C-terminals of GFP fold respectively, exhibiting comparable biophysical properties of the n-GFP. Structural analysis of the variants through computational modeling study gave a few structural insights that can explain the spectral properties of the variants. Our study suggests that the protein sequence-structure space of deletion mutants can be more efficiently explored by employing the protein structure with higher stability.
Collapse
Affiliation(s)
- Govindan Raghunathan
- Department of Chemical Engineering, Pusan National University, Busan, South Korea
| | | | - Sriram Sokalingam
- Department of Chemical Engineering, Pusan National University, Busan, South Korea
| | - Hyungdon Yun
- School of Biotechnology, Yeungnam University, Gyeongsan, South Korea
| | - Sun-Gu Lee
- Department of Chemical Engineering, Pusan National University, Busan, South Korea
- * E-mail:
| |
Collapse
|
10
|
Mittenthal J, Caetano-Anollés D, Caetano-Anollés G. Biphasic patterns of diversification and the emergence of modules. Front Genet 2012; 3:147. [PMID: 22891076 PMCID: PMC3413098 DOI: 10.3389/fgene.2012.00147] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2012] [Accepted: 07/19/2012] [Indexed: 01/08/2023] Open
Abstract
The intricate molecular and cellular structure of organisms converts energy to work, which builds and maintains structure. Evolving structure implements modules, in which parts are tightly linked. Each module performs characteristic functions. In this work we propose that a module can emerge through two phases of diversification of parts. Early in the first phase of this biphasic pattern, the parts have weak linkage-they interact weakly and associate variously. The parts diversify and compete. Under selection for performance, interactions among the parts increasingly constrain their structure and associations. As many variants are eliminated, parts self-organize into modules with tight linkage. Linkage may increase in response to exogenous stresses as well as endogenous processes. In the second phase of diversification, variants of the module and its functions evolve and become new parts for a new cycle of generation of higher-level modules. This linkage hypothesis can interpret biphasic patterns in the diversification of protein domain structure, RNA and protein shapes, and networks in metabolism, codes, and embryos, and can explain hierarchical levels of structural organization that are widespread in biology.
Collapse
Affiliation(s)
- Jay Mittenthal
- Department of Cell and Developmental Biology, University of IllinoisUrbana-Champaign, IL, USA
- Institute for Genomic Biology, University of IllinoisUrbana-Champaign, IL, USA
| | - Derek Caetano-Anollés
- Department of Cell and Developmental Biology, University of IllinoisUrbana-Champaign, IL, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of IllinoisUrbana, IL, USA
- Institute for Genomic Biology, University of IllinoisUrbana-Champaign, IL, USA
| |
Collapse
|
11
|
Harish A, Caetano-Anollés G. Ribosomal history reveals origins of modern protein synthesis. PLoS One 2012; 7:e32776. [PMID: 22427882 PMCID: PMC3299690 DOI: 10.1371/journal.pone.0032776] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2011] [Accepted: 01/30/2012] [Indexed: 02/06/2023] Open
Abstract
The origin and evolution of the ribosome is central to our understanding of the cellular world. Most hypotheses posit that the ribosome originated in the peptidyl transferase center of the large ribosomal subunit. However, these proposals do not link protein synthesis to RNA recognition and do not use a phylogenetic comparative framework to study ribosomal evolution. Here we infer evolution of the structural components of the ribosome. Phylogenetic methods widely used in morphometrics are applied directly to RNA structures of thousands of molecules and to a census of protein structures in hundreds of genomes. We find that components of the small subunit involved in ribosomal processivity evolved earlier than the catalytic peptidyl transferase center responsible for protein synthesis. Remarkably, subunit RNA and proteins coevolved, starting with interactions between the oldest proteins (S12 and S17) and the oldest substructure (the ribosomal ratchet) in the small subunit and ending with the rise of a modern multi-subunit ribosome. Ancestral ribonucleoprotein components show similarities to in vitro evolved RNA replicase ribozymes and protein structures in extant replication machinery. Our study therefore provides important clues about the chicken-or-egg dilemma associated with the central dogma of molecular biology by showing that ribosomal history is driven by the gradual structural accretion of protein and RNA structures. Most importantly, results suggest that functionally important and conserved regions of the ribosome were recruited and could be relics of an ancient ribonucleoprotein world.
Collapse
Affiliation(s)
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana-Champaign, Illinois, United States of America
| |
Collapse
|
12
|
The phylogenomic roots of modern biochemistry: origins of proteins, cofactors and protein biosynthesis. J Mol Evol 2012; 74:1-34. [PMID: 22210458 DOI: 10.1007/s00239-011-9480-1] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2011] [Accepted: 12/12/2011] [Indexed: 12/20/2022]
Abstract
The complexity of modern biochemistry developed gradually on early Earth as new molecules and structures populated the emerging cellular systems. Here, we generate a historical account of the gradual discovery of primordial proteins, cofactors, and molecular functions using phylogenomic information in the sequence of 420 genomes. We focus on structural and functional annotations of the 54 most ancient protein domains. We show how primordial functions are linked to folded structures and how their interaction with cofactors expanded the functional repertoire. We also reveal protocell membranes played a crucial role in early protein evolution and show translation started with RNA and thioester cofactor-mediated aminoacylation. Our findings allow elaboration of an evolutionary model of early biochemistry that is firmly grounded in phylogenomic information and biochemical, biophysical, and structural knowledge. The model describes how primordial α-helical bundles stabilized membranes, how these were decorated by layered arrangements of β-sheets and α-helices, and how these arrangements became globular. Ancient forms of aminoacyl-tRNA synthetase (aaRS) catalytic domains and ancient non-ribosomal protein synthetase (NRPS) modules gave rise to primordial protein synthesis and the ability to generate a code for specificity in their active sites. These structures diversified producing cofactor-binding molecular switches and barrel structures. Accretion of domains and molecules gave rise to modern aaRSs, NRPS, and ribosomal ensembles, first organized around novel emerging cofactors (tRNA and carrier proteins) and then more complex cofactor structures (rRNA). The model explains how the generation of protein structures acted as scaffold for nucleic acids and resulted in crystallization of modern translation.
Collapse
|
13
|
Seufferheld MJ, Kim KM, Whitfield J, Valerio A, Caetano-Anollés G. Evolution of vacuolar proton pyrophosphatase domains and volutin granules: clues into the early evolutionary origin of the acidocalcisome. Biol Direct 2011; 6:50. [PMID: 21974828 PMCID: PMC3198990 DOI: 10.1186/1745-6150-6-50] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2011] [Accepted: 10/05/2011] [Indexed: 01/08/2023] Open
Abstract
Background Volutin granules appear to be universally distributed and are morphologically and chemically identical to acidocalcisomes, which are electron-dense granular organelles rich in calcium and phosphate, whose functions include storage of phosphorus and various metal ions, metabolism of polyphosphate, maintenance of intracellular pH, osmoregulation and calcium homeostasis. Prokaryotes are thought to differ from eukaryotes in that they lack membrane-bounded organelles. However, it has been demonstrated that as in acidocalcisomes, the calcium and polyphosphate-rich intracellular "volutin granules (polyphosphate bodies)" in two bacterial species, Agrobacterium tumefaciens, and Rhodospirillum rubrum, are membrane bound and that the vacuolar proton-translocating pyrophosphatases (V-H+PPases) are present in their surrounding membranes. Volutin granules and acidocalcisomes have been found in organisms as diverse as bacteria and humans. Results Here, we show volutin granules also occur in Archaea and are, therefore, present in the three superkingdoms of life (Archaea, Bacteria and Eukarya). Molecular analyses of V-H+PPase pumps, which acidify the acidocalcisome lumen and are diagnostic proteins of the organelle, also reveal the presence of this enzyme in all three superkingdoms suggesting it is ancient and universal. Since V-H+PPase sequences contained limited phylogenetic signal to fully resolve the ancestral nodes of the tree, we investigated the divergence of protein domains in the V-H+PPase molecules. Using Protein family (Pfam) database, we found a domain in the protein, PF03030. The domain is shared by 31 species in Eukarya, 231 in Bacteria, and 17 in Archaea. The universal distribution of the V-H+PPase PF03030 domain, which is associated with the V-H+PPase function, suggests the domain and the enzyme were already present in the Last Universal Common Ancestor (LUCA). Conclusion The importance of the V-H+PPase function and the evolutionary dynamics of these domains support the early origin of the acidocalcisome organelle. In particular, the universality of volutin granules and presence of a functional V-H+PPase domain in the three superkingdoms of life reveals that the acidocalcisomes may have appeared earlier than the divergence of the superkingdoms. This result is remarkable and highlights the possibility that a high degree of cellular compartmentalization could already have been present in the LUCA. Reviewers This article was reviewed by Anthony Poole, Lakshminarayan Iyer and Daniel Kahn
Collapse
Affiliation(s)
- Manfredo J Seufferheld
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| | | | | | | | | |
Collapse
|
14
|
Wang M, Jiang YY, Kim KM, Qu G, Ji HF, Mittenthal JE, Zhang HY, Caetano-Anollés G. A universal molecular clock of protein folds and its power in tracing the early history of aerobic metabolism and planet oxygenation. Mol Biol Evol 2010; 28:567-82. [PMID: 20805191 DOI: 10.1093/molbev/msq232] [Citation(s) in RCA: 97] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The standard molecular clock describes a constant rate of molecular evolution and provides a powerful framework for evolutionary timescales. Here, we describe the existence and implications of a molecular clock of folds, a universal recurrence in the discovery of new structures in the world of proteins. Using a phylogenomic structural census in hundreds of proteomes, we build phylogenies and time lines of domains at fold and fold superfamily levels of structural complexity. These time lines correlate approximately linearly with geological timescales and were here used to date two crucial events in life history, planet oxygenation and organism diversification. We first dissected the structures and functions of enzymes in simulated metabolic networks. The placement of anaerobic and aerobic enzymes in the time line revealed that aerobic metabolism emerged about 2.9 billion years (giga-annum; Ga) ago and expanded during a period of about 400 My, reaching what is known as the Great Oxidation Event. During this period, enzymes recruited old and new folds for oxygen-mediated enzymatic activities. Remarkably, the first fold lost by a superkingdom disappeared in Archaea 2.6 Ga ago, within the span of oxygen rise, suggesting that oxygen also triggered diversification of life. The implications of a molecular clock of folds are many and important for the neutral theory of molecular evolution and for understanding the growth and diversity of the protein world. The clock also extends the standard concept that was specific to molecules and their timescales and turns it into a universal timescale-generating tool.
Collapse
Affiliation(s)
- Minglei Wang
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana-Champaign, USA
| | | | | | | | | | | | | | | |
Collapse
|